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AND USES THEREOF 

Related Information 

5 This application claims priority to U.S. provisional Application No. 60/121,461, 

entitled "A NOVEL ANDROGEN-INDUCED SUPPRESSOR OF CELL 
PROLIFERATION AND USES THEREOF; 1 filed on February 24, 1999, incorporated 
herein in its entirety by this reference. The contents of all patents, patent applications, and 
references cited herein are expressly incorporated by reference in their entireties. 

10 

Government Sponsored Research 

This work was supported in part by PHS NIH grant CA-55574. 

Background of the Invention 

15 Among men, carcinoma of the prostate is the second most common cancer and the 

second most common cause of death from cancer in the United States. Each year over 
130,000 men are diagnosed with prostate cancer and over 30,000 will die from the disease 
((1992) MMWR 41 :459). Typically, 61 % of all deaths from prostate cancer occur within 
five years of diagnosis and 88% within ten years (Smart (1997) Cancer 80:1835-1844). 

20 Moreover, despite the availability of risk assessment tools, the optimal therapy for treating 
prostate cancer remains controversial (Small (1998) Drugs Aging 13:71-81). For example, 
although certain markers of prostate cancer progression such as prostate-specific antigen 
(PSA) have proven valuable in the diagnosis and management of prostate cancer, as 
currently used, PSA is insufficiently sensitive and specific for early detection or staging of 

25 the malignancy (Daher et al (1998) Clin. Chem. Lab. Med. 36:671-681). 

In addition, in some patients with metastatic disease of the prostate, hormone therapy 
(e.g., antiandrogen, estrogen, etc.) is frequently used. However, many patients on hormone 
therapy develop hormone resistance and the management of hormone refractory disease is a 
major clinical problem (Ismail et al. (1997) Tech Urol. 3:16-24). The death of patients 

30 from prostate cancer is related to the development of clones of cells capable of multiplying 
and metastasizing without androgen stimulation. To date, efforts to suppress these cells 
have been of limited success (Newling (1996) Eur. Urol Suppl 2:69-7 '4). This is in part 
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due to the fact that the initial events in the development of prostate cancer are not well 
understood. 

Normally, cell numbers in the prostate gland are regulated by androgens through 
separate pathways that include a) inhibition of cell death (apoptosis), b) induction of cell 
5 proliferation (Step-1), and c) inhibition of cell proliferation (Step-2, proliferation shutoff). 
In normal tissue, the apoptotic and proliferative activities are minimal and apparently, Step- 
2 (inhibition of cell proliferation) maintains the integrity of the tissue. Prostate cancer cells 
evolve when this circuitry fails in the initial or early phases in prostate cancer. 

10 Summary of the Invention 

A hope for managing prostate cancer lies in the earlier detection of the disease using 
improved diagnostic indicators, and developing markers that will allow for the more 
efficient and strategic use of hormone therapy, preferably in concert with improvements in 
the quality of life for patients with prostate cancer. 

15 To this end, a novel androgen-induced tumor suppressor gene termed "Androgen 

Shutoff Gene 3" (AS3) has been discovered. This gene has a role in inhibiting cell 
proliferation and use as a marker for the efficient diagnosis and treatment of prostate cancer. 

The present invention is based, at least in part, on the discovery of a novel androgen- 
induced tumor suppressor, referred to herein as "Androgen Shutoff Gene 3" or "AS3" 

20 nucleic acid and protein molecules. The AS3 molecules of the present invention are useful 
as targets for developing modulating agents of cell proliferation, particularly cells of the 
prostate. Accordingly, in one aspect, this invention provides isolated nucleic acid molecules 
encoding AS3 proteins or biologically active portions thereof, as well as nucleic acid 
fragments suitable as primers or hybridization probes for the detection of AS3-encoding 

25 nucleic acids. 

In one embodiment, an AS3 nucleic acid molecule of the invention is at least 50%, 
55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or more identical to the nucleotide 
sequence (e.g., to the entire length of the nucleotide sequence) shown in SEQ ID NO: 1 or 3 
or the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 
30 Accession Number , or a complement thereof. 

In a preferred embodiment, the isolated nucleic acid molecule includes the 
nucleotide sequence shown SEQ ID NO:l or 3, or a complement thereof. In another 
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embodiment, the nucleic acid molecule includes SEQ ID NO:3 and nucleotides 66-4238 of 
SEQ ID NO:l. 

In another embodiment, an AS3 nucleic acid molecule includes a nucleotide 
sequence encoding a protein having an amino acid sequence sufficiently homologous to the 
amino acid sequence of SEQ ID NO:2 or an amino acid sequence encoded by the DNA 

insert of the plasmid deposited with ATCC as Accession Number . In a preferred 

embodiment, an AS3 nucleic acid molecule includes a nucleotide sequence encoding a 
protein having an amino acid sequence at least 45%, 50%, 55%, 60%, 65%, 70%, 75%, 
80%, 85%, 90%o, 95%, 98%) or more homologous to the entire length of the amino acid 
sequence of SEQ ID NO:2 or the amino acid sequence encoded by the DNA insert of the 

plasmid deposited with ATCC as Accession Number . In a related embodiment, the 

nucleic acid encodes a polypeptide fragment having at least 1391 amino acid residues of the 
amino acid sequence shown in SEQ ID NO: 2. 

In another preferred embodiment, an isolated nucleic acid molecule encodes the 
amino acid sequence of human AS3. In yet another preferred embodiment, the nucleic acid 
molecule includes a nucleotide sequence encoding a protein having the amino acid sequence 
of SEQ ID NO:2 or the amino acid sequence encoded by the DNA insert of the plasmid 
deposited with ATCC as Accession Number . 

Another embodiment of the invention features nucleic acid molecules, preferably 
AS3 nucleic acid molecules, which specifically detect AS3 nucleic acid molecules relative 
to nucleic acid molecules encoding non-AS3 proteins. For example, in one embodiment, 
such a nucleic acid molecule is at least 100-250, 250-300, 300-350, 350-400, 400-450, 450- 
500, 500-550, or 550-600 or more nucleotides in length and hybridizes under stringent 
conditions to a nucleic acid molecule comprising the nucleotide sequence shown in SEQ ID 
NO: 1, the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 

Accession Number , or a complement thereof. In a related embodiment, the nucleic 

acid molecule can further contain a nucleotide sequence encoding a heterologous 
polypeptide. 

In a related embodiment, the nucleic acid molecules are at least 15 (e.g., contiguous) 
nucleotides in length and hybridize under stringent conditions to nucleotides 1-5253 of SEQ 
ID NO:l. 
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In other preferred embodiments, the nucleic acid molecule encodes a naturally 
occurring allelic variant of a polypeptide comprising the amino acid sequence of SEQ ID 
NO:2 or an amino acid sequence encoded by the DNA insert of the plasmid deposited with 

ATCC as Accession Number , wherein the nucleic acid molecule hybridizes to a 

5 nucleic acid molecule comprising SEQ ID NO:l or 3 under stringent conditions. 

Another embodiment of the invention provides an isolated nucleic acid molecule 
which is antisense to an AS3 nucleic acid molecule, e.g., the coding strand of an AS3 
nucleic acid molecule. 

Another aspect of the invention provides a vector comprising an AS3 nucleic acid 
] 0 molecule. In certain embodiments, the vector is a recombinant expression vector. In 

another embodiment, the invention provides a host cell containing a vector of the invention. 
In yet another embodiment, the invention provides a host cell containing a nucleic acid 
molecule of the invention. The invention also provides a method for producing a protein, 
preferably an AS3 protein, by culturing in a suitable medium, a host cell, e.g., a mammalian 
1 5 host cell such as a non-human mammalian cell, of the invention containing a recombinant 
expression vector, such that the protein is produced. 

Another aspect of this invention features isolated or recombinant AS3 proteins and 
polypeptides. In one embodiment, the isolated protein, preferably an AS3 protein has an 
amino acid sequence at least about 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 
20 95%, 98% or more homologous to the amino acid sequence of SEQ ID NO:2 or the amino 
acid sequence encoded by the DNA insert of the plasmid deposited with ATCC as 

Accession Number . 

In another embodiment, the invention features fragments of the AS 3 protein having 
the amino acid sequence of SEQ ID NO:2, wherein the fragment comprises at least 15 
25 amino acids (e.g., contiguous amino acids) of the amino acid sequence of SEQ ID NO:2 or 
an amino acid sequence encoded by the DNA insert of the plasmid deposited with the ATCC 

as Accession Number . In another embodiment, the protein, preferably an AS3 

protein, has the amino acid sequence of SEQ ID NO:2, respectively. 

In another embodiment, the invention features an isolated protein, preferably an AS3 
30 protein, which is encoded by a nucleic acid molecule consisting of a nucleotide sequence at 
least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or more 
homologous to a nucleotide sequence of SEQ ID NO:l or 3, or a complement thereof This 
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invention further features an isolated protein, preferably an AS3 protein, which is encoded 
by a nucleic acid molecule consisting of a nucleotide sequence which hybridizes under 
stringent hybridization conditions to a nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO:l or 3, or a complement thereof. 
5 The proteins of the present invention or portions thereof, e.g., biologically active 

portions thereof, can be operatively linked to a non-AS3 polypeptide (e.g., heterologous 
amino acid sequences) to form fusion proteins. The invention further features antibodies, 
such as monoclonal or polyclonal antibodies, that specifically bind proteins of the invention, 
preferably AS3 proteins. 

1 0 In another aspect, the present invention provides a method for detecting the presence 

of an AS3 polypeptide in a biological sample by contacting the biological sample with a 
compound capable of detecting an AS3 polypeptide. In one embodiment of this invention, 
the compound is an antibody. In another embodiment of this invention, a kit is featured that 
contains a compound that selectively binds to the polypeptide and instructions for use. 

15 In another aspect, the present invention features a method for detecting the presence 

of an AS3 nucleic acid in a biological sample by contacting the biological sample with a 
nucleic acid probe or primer that selectively hybridizes to an AS3 molecule of the invention 
and indicating the presence of such a molecule. In other embodiments of this invention, the 
nucleic acid in the biological sample is RNA. In another embodiment of the invention, a kit 

20 is provided comprising at least one reagent that binds a nucleic acid and instructions for use. 

In another aspect, the invention provides for a method for identifying a compound 
that binds to an AS3 polypeptide or fragment by detecting direct binding of the compound, 
binding using a competition assay, or binding using an AS3 activity. 

In yet another aspect, the invention provides a method for identifying a compound 

25 that modulates the activity of an AS3 polypeptide or fragment by contacting such a 
polypeptide or cell expressing such a polypeptide and determining the effect of the test 
compound on the activity of the polypeptide. 

In another aspect, the invention provides a method for modulating AS3 activity 
comprising contacting a cell capable of expressing AS3 with a compound that modulates 

30 AS3 activity such that AS3 activity in the cell is modulated. In one embodiment, the agent 
inhibits AS3 activity. In another embodiment, the agent stimulates AS3 activity. In one 



MBI-008 

-6- 

embodiment, the compound modulates expression of AS3 by modulating transcription of an 
AS3 gene or translation of an AS3 mRNA. 

In another aspect, the invention features a transgenic animal generated from a cell 
genetically engineered to lack nucleic acid encoding an AS3 polypeptide, where the 
5 transgenic animal lacks expression of the AS3 polypeptide. 

In a related aspect, the invention features a transgenic animal generated from a cell 
that contains a substantially pure nucleic acid encoding a AS3 polypeptide, where the 
nucleic acid is expressed in the transgenic animal. 

In another aspect, the invention features a method of identifying a compound that 
1 0 modulates cell proliferation. The method includes: (a) providing a cell that has an AS3 
gene; (b) contacting the cell with a candidate compound; and (c) monitoring expression of 
the AS3 gene, where an alteration in the level of expression of the AS3 gene indicates the 
presence of a compound which modulates cell proliferation. In one preferred embodiment 
of this aspect, the alteration that is an increase indicates the compound is inhibiting cell 
1 5 proliferation, and the alteration that is a decrease indicates the compound is increasing cell 
proliferation 

In a related aspect, the invention features another method of identifying a compound 
that is able to modulate cell proliferation that includes: (a) providing a cell including a 
reporter gene operably linked to a promoter from an AS3 gene; (b) contacting the cell with a 

20 candidate compound; and (c) measuring expression of the reporter gene, where a change in 
the expression in response to the candidate compound identifies a compound that is able to 
modulate cell proliferation. In one preferred embodiment of this aspect, the alteration that is 
an increase indicates the compound is inhibiting cell proliferation. 

In another aspect, the invention features a method of inhibiting the proliferation of a 

25 cell by administering an amount of AS3 polypeptide or fragment thereof sufficient to inhibit 
cell proliferation. 

In related aspects, the invention includes methods of decreasing cell proliferation by 
either providing a transgene encoding a AS3 polypeptide or fragment thereof to a cell of an 
animal such that the transgene is positioned for expression in the cell; or by administering to 
30 the cell a compound which increases AS3 biological activity in a cell. In preferred 
embodiments, AS3 is from a mammal, the cell being treated is in a mammal, and the 
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mammal has been diagnosed with a condition involving cell proliferation such as cancer 
(e.g., prostate cancer). 

In two other aspects, the invention features methods of diagnosing a mammal for the 
presence of disease involving altered cell proliferation or an increased likelihood of 
5 developing a disease involving altered cell proliferation. The methods include isolating a 
sample of nucleic acid from the mammal and determining whether the nucleic acid includes 
a AS3 mutation, where the presence of a mutation is an indication that the animal has a cell 
proliferation disease or an increased likelihood of developing a disease involving cell 
proliferation; or measuring AS 3 gene expression in a sample from an animal to be 

10 diagnosed, where an alteration in the expression or activity relative to a sample from an 
unaffected mammal is an indication that the mammal has a disease involving cell 
proliferation or increased likelihood of developing such a disease. In preferred 
embodiments, AS3 gene expression is measured by assaying the amount of AS3 polypeptide 
or AS3 biological activity in the sample (e.g., the AS3 polypeptide is measured by 

1 5 immunological methods), or AS3 gene expression is measured by assaying the amount of 
AS3 RNA in the sample. In other preferred embodiments, the mammal is a human and the 
method may be performed after or during hormone therapy (e.g., androgen therapy). 

In another aspect, the invention features a kit for diagnosing a mammal for the 
presence of a disease involving altered cell proliferation or an increased likelihood of 

20 developing a disease involving altered cell proliferation that includes a substantially pure 
antibody that specifically binds a AS 3 polypeptide. Another such kit includes material for 
measuring AS3 RNA (e.g., a probe). In a preferred embodiment, the material is a nucleic 
acid probe. 

In a yet another aspect, the invention features a method of obtaining a AS3 
25 polypeptide, including: (a) providing a cell with DNA encoding a AS3 polypeptide, the 
DNA being positioned for expression in the cell; (b) culturing the cell under conditions for 
expressing the DNA; and (c) isolating the AS3 polypeptide. 

In a related aspect, the invention features a method of isolating an AS3 gene or 
portion thereof having sequence identity to human AS3. The method includes amplifying 
30 by polymerase chain reaction the AS3 gene or portion thereof using oligonucleotide primers 
wherein the primers (a) are each greater than 15 nucleotides in length; (b) each have regions 
of complementarity to opposite DNA strands in a region of the nucleotide sequence 
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provided in SEQ ID NO: 1; and (c) optionally contain sequences capable of producing 
restriction endonuclease cut sites in the amplified product; and isolating the AS3 gene or 
portion thereof. 

In another aspect of the invention, the invention features a method for detecting if a 
5 subject is at increased risk for developing prostate cancer including the steps of (a) detecting 
the presence of an AS3 nucleic acid or polypeptide and (b) observing whether or not a 
subject has reduced or absent AS3 levels as compared to a standard, e.g., normal age 
matched control, wherein said reduced or absent AS3 levels indicate that the subject is at an 
increased risk for developing prostate cancer. In a related embodiment, the invention 
1 0 features a kit that contains at least one reagent for detecting the presence of an AS3 
molecule. 

In another aspect, the invention provides a method of prognosis for prostate cancer 
by obtaining a biological sample from the subject measuring AS3 levels, correlating those 
levels with a control, and determining a prognosis based on whether the subject's AS3 levels 
1 5 are above average or below average. In related embodiments, the method may be performed 
during or after hormone therapy (e.g., androgen therapy) and employs an antibody or nucleic 
acid probe to an AS3 molecule. 

In even another aspect, the invention features a method for the treatment of prostate 
cancer comprising identifying a subject with prostate cancer or about to have prostate 
20 cancer, administering a hormone therapy, and determining if the subject exhibits a change in 
AS3 levels. In preferred embodiments, the invention provides a method for identifying 
subjects that exhibit increased AS 3 levels after receiving hormone as responsive to hormone 
therapy and further, as candidates for intermittent hormone therapy. In other preferred 
embodiments, the hormone therapy is an androgen therapy, the subject is a human, and the 
25 method for measuring includes measuring AS3 nucleic acid or polypeptide levels is 
performed with an antibody or nucleic acid probe or primer. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 
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Brief Description of the Drawings 

Figure 1 depicts the cDNA sequence and predicted amino acid sequence of human 
AS3. The nucleotide sequence corresponds to 1 to 5253 of SEQ ID NO: 1. The amino acid 
sequence corresponds to amino acids 1 to 1391 of SEQ ID NO: 2. The coding region 
5 without the 5' and 3' untranslated regions of the human AS3 gene is shown in SEQ ID NO: 
3. Numbers on the left indicate positions in base pairs. The amino acid sequence of the 
open reading frame is depicted under the coding strand. Numbers on the right indicate amino 
acid positions. Destabilizing signals found in the untranslated regions of AS3 are underlined 
and the polyadenylation signal and cleavage site are at base pair positions 5228-5233 and 
10 5249-5253, respectively. 

Figure 2 depicts the N-terminal leucine repeat structure of the AS3 polypeptide. 
Numbers above the AS3 sequence indicate the positions of the blocks where uninterrupted 
leucine (or isoleucine, valine) heptades occur. 

Figure 3 depicts sequence comparisons of the putative Mg-nucleotide triphosphate 
1 5 binding subdomains of AS3 with corresponding subdomains of various protein kinases. The 
boxes represent the Hanks' conserved subdomains, as indicated above each box. The top 
lines within the boxes show the consensus p-strand, loop, and ot-helical secondary structure 
elements. The numbers in the second lines indicate the positions of the corresponding 
conformations in the AS3 sequence. The actual AS3 motifs are shown in the third line. 
20 Hanks' conserved subdomains from protein kinases of close similarity are represented in the 
lines below the AS3 sequence. The names of the kinases are indicated in parentheses. 
Identical residues are highlighted. In the Mg-ATP binding loop, the "x" and lower case 
letters indicate non-conserved amino acids. 

Figure 4 depicts the genomic, cosmid, and exon maps of the AS3 cDNA. The 
25 chromosomal panel represents a 1 megabase (Mb) genomic region around BRCA2. Boxes 
with CG numbers are genomic areas where expression of transcripts were detected. The 
centromer is at the left. The PI artificial chromosome (PAC) PAC26H23 (Accession No.: 
Z84467) overlaps with cosmid 267pl9 (Accession No .: Z75889), which, in turn, overlaps 
with PAC49J10 (Accession No.: Z84572). Numbers below the PAC and cosmid lines 
30 indicate positions within the genomic clone. The scale above the exon map indicates the 
genomic distance in thousands of base pairs. In the exon panel, black boxes represent the 
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exons, while the numbers above them indicate exon numbers. In the mRNA panel, the 
numbers indicate nucleotide positions. 

Figure 5 depicts the genomic and cDNA positions of exons in the AS3 transcript. 
Asterisks represent the exon-intron boundaries. The area between asterisks represents the 
5 exons. Exon sequences are in upper case, the numbers represent cDNA positions. Lower 
case letters are intron sequences. Numbers of the first exon indicate positions in 
PAC26H23. Numbers in parenthesis refer to positions on cosmid 267pl9, while numbers in 
brackets refer to PAC49J10 positions. 

Figure 6. depicts the cDNA sequence and predicted amino acid sequence of human 

10 AS3 having an additional 84 base pairs of untranslated 5' sequence as compared to the 

sequence presented in Fig. 1. The nucleotide sequence corresponds to 1 to 5337 of SEQ ID 
NO: 4. The amino acid sequence shown corresponds to amino acids 1 to 1391 of SEQ ID 
NO: 2. The coding region without the 5' and 3' untranslated regions of the human AS3 
gene is shown in SEQ ID NO: 3. Numbers on the left indicate positions in base pairs. The 

1 5 amino acid sequence of the open reading frame is depicted under the coding strand. 
Numbers on the right indicate amino acid positions. Destabilizing signals found in the 
untranslated regions of AS3 are underlined and the polyadenylation signal and cleavage 
signal are at base pair positions 5312-5317 and 5333-5337, respectively. 

20 Detailed Description of the Invention 

The present invention is based, at least in part, on the discovery of a novel molecule 
referred to herein as "Androgen Shutoff Gene 3" or "AS3" nucleic acid and protein 
molecules, which play a role in hormone-induced inhibition of cell proliferation. In one 
embodiment, the AS3 molecules are capable of modulating cell proliferation, e.g., cancer. 
25 In a preferred embodiment, the AS3 molecules are expressed in cells of the prostate and/or 
function in the cells of the prostate. In another preferred embodiment, the AS3 molecules 
are expressed in prostate cells when exposed to a hormone, e.g., an androgen, and inhibit 
cell proliferation. 

The AS3 gene was cloned from a subtracted library made from a human prostate 
30 carcinoma cell line induced to undergo growth arrest using androgen. 

Androgens regulate prostate cell numbers and cell proliferation by three major 
mechanisms: a) inhibition of cell death (apoptosis), b) induction of cell proliferation (Step- 
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1), and c) inhibition of cell proliferation (proliferative shutoff, Step-2) (Isaacs (1985) 
Prostate 5:545-557; Bruchovsky et ah (1975) Vit. & Horm. 33:61-102; Sonnenschein et ah, 
(1989) Cancer Res. 49:3474-3481). Androgens affect epithelial and stromal cell types 
which, in turn, interact in the prostate (Hayward et ah, (1997) Brit. J. Uroh Supph 2: 18-26). 
5 The human prostate LNCaP-FGC cell line that exhibits hormone responsiveness and is used 
extensively for endocrine and molecular studies (Sonnenschein et ah, (1989) Cancer Res, 
49:3474-3481; Horoszewicz et ah, (1983) Cancer Res. 43:1809-1818; Soto et ah, (1995) 
Oncology Res. 7: 545-558) was employed herein. Proliferation is inhibited in these cells by 
sex steroid-stripped (charcoal-dextran treated) human serum (CDHuS) (Sonnenschein et ah, 

10 (1989) Cancer Res. 49:3474-3481). Low androgen concentrations cancel this inhibition 
(Step-1) and at higher levels, androgens induce an irreversible proliferative shutoff (Step-2) 
(Sonnenschein et ah, (1989) Cancer Res. 49:3474-3481; Soto et ah, (1995) Oncology Res. 
7: 545-558). During the shutoff period, these cells remain in the G 0 /G] phase of the cell 
cycle. Prostate specific antigen (PSA) induction, however, is still dependent on androgens 

1 5 in these postmitotic cells (Soto et ah , (1 995) Oncology Res. 7: 545-558). 

To dissect androgen-mediated cell-cycle events, several androgen target cell lines 
that express only one of the steps of the androgen regulated proliferative response were 
generated. Two LNCaP variants were isolated: the LNCaP-TAC variant which expresses 
Step-1 only, and the LNCaP-TJA variant, which is resistant to the inhibitory effect of both 

20 CD serum and androgens (Soto et ah, (1995) Oncology Res. 7: 545-558). LNCaP-LNO 
cells, established by Horoszewicz et aL, proliferate maximally in the presence of CDHuS, 
express an androgen-induced proliferative shutoff, and undergo G 0 /Gi arrest (Step-2) at high 
androgen concentrations (Horoszewicz et ah, (1983) Cancer Res. 43:1809-1818; Soto et ah, 
(1995) Oncology Res. 7: 545-558). In addition to these human prostate cells, a new model 

25 to demonstrate the shutoff effect in another cell type was developed by stable transfection of 
a wild type androgen-receptor construct into breast carcinoma MCF-7 cells. These MCF7- 
AR1 cells are also able to evoke a proliferative shutoff in response to androgens (Szelei et 
ah, (1997) Endocrinology 138: 1406-1412). 

Using a differential subtractive amplification procedure, a set of genes induced in the 

30 proliferative shutoff response (Step-2) (Wang et ah, (1991) Proc. Natl. Acad. Set USA 88: 
1 1505-1 1509) was identified. In particular, a gene involved in this regulation, AS3 
(androgen shutoff gene 3), that shows high expression in the early regulatory phase of 
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androgen-induced proliferative shutoff in cell culture and in the prostates of castrated rats 
was identified. 

The AS3 gene encodes a polypeptide of 1391 -residues and has a molecular weight of 
about 1 86 kD. It has coiled-coil structures that usually participate in protein-protein 
5 interactions, a perfect leucine-zipper that suggests DNA binding, nuclear localization motifs, 
proline- and serine-rich domains, unique C-terminal acidic-basic repeats, and ATP- and 
DNA-binding motifs. 

The transcript has 34 exons in a 200,000 bp region on chromosome 13ql2-ql3, 
downstream of the breast cancer susceptibility gene BRCA2, and centromeric to the 
10 retinoblastoma (Rbl) locus. This area is subject to frequent allelic losses in cancers, and is 
believed to carry a number of cryptic suppressor genes. 

The AS3 gene is involved in the regulation of androgen-induced proliferative arrest 
in human prostate cells. Accordingly, the AS3 molecules described below serve as useful 
diagnostic markers or therapeutic agents to control conditions of aberrant cell proliferation, 
15 such as cancer (e.g., cancer of the prostate). 

In one embodiment, the present invention is directed at human AS3, however, AS3 
family members are also intended to be within the scope of the invention. The term 
"family" when referring to the protein and nucleic acid molecules of the invention is 
intended to mean two or more proteins or nucleic acid molecules having a common 
20 structural domain or motif and having sufficient amino acid or nucleotide sequence 
homology as defined herein. Such family members can be naturally or non-naturally 
occurring and can be from either the same or different species. For example, a family can 
contain a first protein of human origin, as well as other, distinct proteins of human origin or 
alternatively, can contain homologues of non-human origin. Members of a family may also 
25 have common functional characteristics. 

To identify the presence of important domains in a given polypeptide, and make the 
determination that a protein of interest has a particular profile, the amino acid sequence of 
the protein can be searched against several databases as described in Example 3. Using 
these tools, a number of important domains within the AS3 polypeptide have been identified 
30 and these results are set forth in Figures 2 and 3 and further described in Example 3. 

Isolated proteins of the present invention, preferably AS3 proteins, have an amino acid 
sequence sufficiently homologous to the amino acid sequence of SEQ ID NO:2 or are encoded 
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by a nucleotide sequence sufficiently homologous to SEQ ID NO:l or 3. As used herein, the 
term "sufficiently homologous" refers to a first amino acid or nucleotide sequence which 
contains a sufficient or minimum number of identical or equivalent (e.g., an amino acid residue 
which has a similar side chain) amino acid residues or nucleotides to a second amino acid or 
5 nucleotide sequence such that the first and second amino acid or nucleotide sequences share 
common structural domains or motifs and/or a common functional activity. For example, 
amino acid or nucleotide sequences which share common structural domains have at least 30%, 
40%, or 50% homology, preferably 60% homology, more preferably 70%-80%, and even more 
preferably 90-95% homology across the amino acid sequences of the domains and contain at 

10 least one and preferably two structural domains or motifs, are defined herein as sufficiently 

homologous. Furthermore, amino acid or nucleotide sequences which share at least 30%, 40%, 
or 50%, preferably 60%, more preferably 70-80%, or 90-95% homology and share a common 
functional activity are defined herein as sufficiently homologous. 

As used interchangeably herein, an "AS3 activity", "biological activity of AS3" or 

1 5 "functional activity of AS3", refers to an activity exerted by an AS3 protein, polypeptide or 
nucleic acid molecule on an AS3 responsive cell or on an AS3 protein substrate, as 
determined in vivo, or in vitro, according to standard techniques. In one embodiment, an 
AS3 activity is a direct activity, such as an association with an AS3-target molecule. As 
used herein, a "target molecule" or "binding partner" is a molecule with which an AS3 

20 protein binds or interacts in nature, such that AS3 -mediated function is achieved. An AS3 
target molecule can be a non-AS3 molecule or an AS3 protein or polypeptide of the present 
invention. Alternatively, an AS3 activity is an indirect activity, such as modulating cell 
cycle events. Preferably, an AS3 activity is the ability to modulate androgen-mediated cell 
proliferation. 

25 Accordingly, another embodiment of the invention features isolated AS3 proteins 

and polypeptides having an AS3 activity. Preferred proteins are AS3 proteins encoded by a 
nucleic acid molecule having a nucleotide sequence which hybridizes under stringent 
hybridization conditions to a nucleic acid molecule comprising the nucleotide sequence of 
SEQ IDNO:l or 3. 

30 The nucleotide sequence of the isolated human AS3 cDNA and the predicted amino 

acid sequence of the human AS3 polypeptide are shown in Figure 1 and in SEQ ID NOs:l 
and 2, respectively. A plasmid containing the nucleotide sequence encoding human AS3 
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was deposited with the American Type Culture Collection (ATCC), 10801 University 

Boulevard, Manassas, VA 201 10-2209, on and assigned Accession Number . 

This deposit will be maintained under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedure. This 
5 deposit was made merely as a convenience for those of skill in the art and is not an 
admission that a deposit is required under 35 U.S.C. §112. 

The human AS3 gene, which is approximately 5253 nucleotides in length, encodes a 
protein having a molecular weight of approximately 186 kD and which is approximately 
1391 amino acid residues in length. 
1 0 Various aspects of the invention are described in further detail in the following 

subsections: 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 

1 5 AS3 proteins or biologically active portions thereof, as well as nucleic acid fragments 

sufficient for use as hybridization probes to identify AS3-encoding nucleic acid molecules 
(e.g., AS3 mRNA) and fragments for use as PCR primers for the amplification or mutation 
of AS3 nucleic acid molecules. As used herein, the term "nucleic acid molecule" is intended 
to include DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., 

20 mRNA) and analogs of the DNA or RNA generated using nucleotide analogs. The nucleic 
acid molecule can be single-stranded or double-stranded, but preferably is double-stranded 
DNA. 

The term "isolated nucleic acid molecule" includes nucleic acid molecules which are 
separated from other nucleic acid molecules which are present in the natural source of the 

25 nucleic acid. For example, with regards to genomic DNA, the term "isolated" includes 
nucleic acid molecules which are separated from the chromosome with which the genomic 
DNA is naturally associated. Preferably, an "isolated" nucleic acid is free of sequences 
which naturally flank the nucleic acid (i.e., sequences located at the 5' and 3' ends of the 
nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. 

30 For example, in various embodiments, the isolated AS3 nucleic acid molecule can contain 
less than about 5 kb, 4kb, 3kb, 2kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which 
naturally flank the nucleic acid molecule in genomic DNA of the cell from which the nucleic 
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acid is derived (see Fig. 4). Moreover, an "isolated" nucleic acid molecule, such as a cDNA 
molecule, can be substantially free of other cellular material, or culture medium when 
produced by recombinant techniques, or substantially free of chemical precursors or other 
chemicals when chemically synthesized. 
5 A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 

having the nucleotide sequence of SEQ ID NO:l or 3, or the nucleotide sequence of the 

DNA insert of the plasmid deposited with ATCC as Accession Number , or a portion 

thereof, can be isolated using standard molecular biology techniques and the sequence 
information provided herein. Using all or portion of the nucleic acid sequence of SEQ ID 

10 NO:l or 3, or the nucleotide sequence of the DNA insert of the plasmid deposited with 

ATCC as Accession Number , as a hybridization probe, AS3 nucleic acid molecules 

can be isolated using standard hybridization and cloning techniques (e.g., as described in 
Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual 
2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold 

15 Spring Harbor, NY, 1989). In a related embodiment, the invention features an AS3 nucleic 
acid molecule having the sequence of SEQ ID NO: 4 (see Fig. 6) which is identical to the 
AS3 sequence provided in SEQ ID NO:l (see Fig. 1) except for an additional 84 base pairs 
at the 5' end of the molecule. One skilled in the art would recognize that this additional 
untranslated 5' sequence may indicate that an alternative start site for the beginning of 

20 transcription of the AS3 mRNA molecule may exist. 

Moreover, a nucleic acid molecule encompassing all or a portion of SEQ ID NO:l or 
3, or the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 

Accession Number can be isolated by the polymerase chain reaction (PCR) using 

synthetic oligonucleotide primers designed based upon the sequence of SEQ ID NO:l or 3, 

25 or the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 

Accession Number . 

A nucleic acid of the invention can be amplified using cDNA, mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified can be 

30 cloned into an appropriate vector and characterized by DNA sequence analysis. 

Furthermore, oligonucleotides corresponding to AS3 nucleotide sequences can be prepared 
by standard synthetic techniques, e.g., using an automated DNA synthesizer. 
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In a preferred embodiment, an isolated nucleic acid molecule of the invention 
comprises the nucleotide sequence shown in SEQ ID NO: 1 . The sequence of SEQ ID NO: 1 
corresponds to the human AS3 cDNA. This cDNA comprises sequences encoding the 
human AS3 protein (i.e., "the coding region", from nucleotides 66-4238), as well as 5' 
5 untranslated sequences (nucleotides 1-65) and 3' untranslated sequences (nucleotides 4239- 
5253). Alternatively, the nucleic acid molecule can comprise only the coding region of SEQ 
ID NO:l (e.g., nucleotides 66-4238, corresponding to SEQ ID NO:3). 

In another preferred embodiment, an isolated nucleic acid molecule of the invention 
comprises a nucleic acid molecule which is a complement of the nucleotide sequence shown 

10 in SEQ ID NO:l or 3, or the nucleotide sequence of the DNA insert of the plasmid deposited 

with ATCC as Accession Number , or a portion of any of these nucleotide sequences. 

A nucleic acid molecule which is complementary to the nucleotide sequence shown in SEQ 
ID NO:l or 3, or the nucleotide sequence of the DNA insert of the plasmid deposited with 
ATCC as Accession Number , is one which is sufficiently complementary to the 

1 5 nucleotide sequence shown in SEQ ID NO: 1 or 3, or the nucleotide sequence of the DNA 

insert of the plasmid deposited with ATCC as Accession Number , such that it can 

hybridize to the nucleotide sequence shown in SEQ ID NO:l or 3, or the nucleotide 
sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 
thereby forming a stable duplex. 

20 In still another preferred embodiment, an isolated nucleic acid molecule of the 

present invention comprises a nucleotide sequence which is at least about 50%, 55%, 60%, 
65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or more homologous to the entire length of the 
nucleotide sequence shown in SEQ ID NO:l or 3, or the entire length of the nucleotide 
sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 

25 , or a portion of any of these nucleotide sequences. 

Moreover, the nucleic acid molecule of the invention can comprise only a portion of 
the nucleic acid sequence of SEQ ID NO:l or 3, or the nucleotide sequence of the DNA 

insert of the plasmid deposited with ATCC as Accession Number , for example, a 

fragment which can be used as a probe or primer or a fragment encoding a portion of an AS3 

30 protein, e.g., a biologically active portion of an AS3 protein. The nucleotide sequence 
determined from the cloning of the AS3 gene allows for the generation of probes and 
primers designed for use in identifying and/or cloning other AS3 family members, as well as 
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AS3 homologues from other species. The probe/primer typically comprises substantially 
purified oligonucleotide. The oligonucleotide typically comprises a region of nucleotide 
sequence that hybridizes under stringent conditions to at least about 12 or 15, preferably 
about 20 or 25, more preferably about 30, 35, 40, 45, 50, 55, 60, 65, or 75 consecutive 
5 nucleotides of a sense sequence of SEQ ID NO: 1 or 3, or the nucleotide sequence of the 
DNA insert of the plasmid deposited with ATCC as Accession Number , of an anti- 
sense sequence of SEQ ID NO:l or 3, or the nucleotide sequence of the DNA insert of the 

plasmid deposited with ATCC as Accession Number , or of a naturally occurring 

allelic variant or mutant of SEQ ID NO: 1 or 3, or the nucleotide sequence of the DNA insert 

10 of the plasmid deposited with ATCC as Accession Number . In an exemplary 

embodiment, a nucleic acid molecule of the present invention comprises a nucleotide 
sequence which is greater than 100-150, 150-200, 200-250, 250-300, 300-350, 350-400, 
400-450, 450-500, 500-550, 550-600, 600-650, 650-700, 700-750, 750-800, 800-850, 850- 
900, 900-950, 950-1000, or more nucleotides in length and hybridizes under stringent 

1 5 hybridization conditions to a nucleic acid molecule of SEQ ID NO: 1 or 3, or the nucleotide 
sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 



Probes based on the AS3 nucleotide sequences can be used to detect transcripts or 
genomic sequences encoding the same or homologous proteins. In preferred embodiments, 

20 the probe further comprises a label group attached thereto, e.g., the label group can be a 

radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can 
be used as a part of a diagnostic test kit for identifying cells or tissue which misexpress an 
AS3 protein, such as by measuring a level of an AS3 -encoding nucleic acid in a sample of 
cells from a subject e.g., detecting AS3 mRNA levels or determining whether a genomic 

25 AS3 gene has been mutated or deleted. 

A nucleic acid fragment encoding a "biologically active portion of an AS3 protein" 
can be prepared by isolating a portion of the nucleotide sequence of SEQ ID NO: 1 or 3, or 
the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 
Accession Number , which encodes a polypeptide having an AS 3 biological activity 

30 (the biological activities of the AS3 proteins are described herein), expressing the encoded 
portion of the AS3 protein (e.g., by recombinant expression in vitro) and assessing the 
activity of the encoded portion of the AS3 protein. 
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The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequence shown in SEQ ID NO:l or 3 , or the nucleotide sequence of the DNA 

insert of the plasmid deposited with ATCC as Accession Number , due to degeneracy 

of the genetic code and thus encode the same AS3 proteins as those encoded by the 
5 nucleotide sequence shown in SEQ ID NO: 1 or 3 , or the nucleotide sequence of the DNA 

insert of the plasmid deposited with ATCC as Accession Number . In another 

embodiment, an isolated nucleic acid molecule of the invention has a nucleotide sequence 
encoding a protein having an amino acid sequence shown in SEQ ID NO:2. 

In addition to the AS3 nucleotide sequences shown in SEQ ID NO:l or 3, or the 

1 0 nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as Accession 

Number , it will be appreciated by those skilled in the art that DNA sequence 

polymorphisms that lead to changes in the amino acid sequences of the AS3 proteins may 
exist within a population (e.g., the human population). Such genetic polymorphism in the 
AS3 genes may exist among individuals within a population due to natural allelic variation. 

1 5 As used herein, the terms "gene" and "recombinant gene" refer to nucleic acid molecules 
which include an open reading frame encoding an AS3 protein, preferably a mammalian 
AS3 protein, and can further include non-coding regulatory sequences, and introns. 

Allelic variants of human AS3 include both functional and non-functional AS3 
proteins. Functional allelic variants are naturally occurring amino acid sequence variants of 

20 the human AS3 protein that maintain the ability to modulate cell proliferation, e.g., 

androgen-induced changes in cell proliferation. Functional allelic variants will typically 
contain only conservative substitution of one or more amino acids of SEQ ID NO: 2 or 
substitution, deletion or insertion of non-critical residues in non-critical regions of the 
protein. 

25 Non-functional allelic variants are naturally occurring amino acid sequence variants 

of the human AS3 protein that do not have the ability to modulate cell proliferation, for 
example, hormone-induced changes in cell proliferation. Non-functional allelic variants will 
typically contain a non-conservative substitution, a deletion, or insertion or premature 
truncation of the amino acid sequence of SEQ ID NO:2 or a substitution, insertion or 

30 deletion in critical residues or critical regions. 

The present invention further provides non-human orthologues of the human AS 3 
protein. Orthologues of the human AS3 protein are proteins that are isolated from non- 
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human organisms and possess the same AS3 ability to modulate cell proliferation, for 
example, hormone-induced changes in cell proliferation. Orthologues of the human AS3 
protein can readily be identified as comprising an amino acid sequence that is substantially 
homologous to SEQ ID NO:2. 
5 Moreover, nucleic acid molecules encoding other AS3 family members and, thus, 

which have a nucleotide sequence which differs from the AS 3 sequences of SEQ ID NO: 1 
or 3, or the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 

Accession Number are intended to be within the scope of the invention. For example, 

another AS3 cDNA can be identified based on the nucleotide sequence of human AS3. 

10 Moreover, nucleic acid molecules encoding AS3 proteins from different species, and which, 
thus, have a nucleotide sequence which differs from the AS3 sequences of SEQ ID NO:l or 
3, or the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 

Accession Number are intended to be within the scope of the invention. For example, 

a mouse AS3 cDNA can be identified based on the nucleotide sequence of a human AS3. 

1 5 Nucleic acid molecules corresponding to natural allelic variants and homologues of 

the AS3 cDNAs of the invention can be isolated based on their homology to the AS3 nucleic 
acids disclosed herein using the cDNAs disclosed herein, or a portion thereof, as a 
hybridization probe according to standard hybridization techniques under stringent 
hybridization conditions. Nucleic acid molecules corresponding to natural allelic variants 

20 and homologues of the AS3 cDNAs of the invention can further be isolated by mapping to 
the same chromosome or locus as the AS3 gene. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 15, 20, 25, 30 or more nucleotides in length and hybridizes under 
stringent conditions to the nucleic acid molecule comprising the nucleotide sequence of SEQ 

25 ID NO: 1 or 3, or the nucleotide sequence of the DNA insert of the plasmid deposited with 

ATCC as Accession Number . In other embodiment, the nucleic acid is at least 30, 

50, 100, 150, 200, 250, 253, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 
900, or 950 nucleotides in length. As used herein, the term "hybridizes under stringent 
conditions" is intended to describe conditions for hybridization and washing under which 

30 nucleotide sequences at least 60% homologous to each other typically remain hybridized to 
each other. Preferably, the conditions are such that sequences at least about 70%, more 
preferably at least about 80%, even more preferably at least about 85% or 90% homologous 
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to each other typically remain hybridized to each other. Such stringent conditions are 
known to those skilled in the art and can be found in Current Protocols in Molecular 
Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. A preferred, non-limiting example 
of stringent hybridization conditions are hybridization in 6X sodium chloride/sodium citrate 
5 (SSC) at about 45C, followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50°C, 
preferably at 55°C, more preferably at 60°C, and even more preferably at 65°C. Preferably, 
an isolated nucleic acid molecule of the invention that hybridizes under stringent conditions 
to the sequence of SEQ ID NO:l or 3 corresponds to a naturally-occurring nucleic acid 
molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA 
10 or DNA molecule having a nucleotide sequence that occurs in nature (e.g., encodes a natural 
protein). 

In addition to naturally-occurring allelic variants of the AS3 sequences that may exist 
in the population, the skilled artisan will further appreciate that changes can be introduced by 
mutation into the nucleotide sequences of SEQ ID NO:l or 3, or the nucleotide sequence of 

1 5 the DNA insert of the plasmid deposited with ATCC as Accession Number , thereby 

leading to changes in the amino acid sequence of the encoded AS3 proteins, without altering 
the functional ability of the AS3 proteins. For example, nucleotide substitutions leading to 
amino acid substitutions at "non-essential" amino acid residues can be made in the sequence 
of SEQ ID NO:l or 3, or the nucleotide sequence of the DNA insert of the plasmid deposited 

20 with ATCC as Accession Number . A "non-essential" amino acid residue is a residue 

that can be altered from the wild-type sequence of AS3 (e.g., the sequence of SEQ ID NO:2) 
without altering the biological activity, whereas an "essential" amino acid residue is required 
for biological activity. For example, amino acid residues that are conserved among the AS3 
proteins of the present invention, e.g., those present in the heptad repeat or kinase domains, 

25 are predicted to be particularly recalcitrant to alteration. Furthermore, additional amino acid 
residues that are conserved between the AS3 proteins of the present invention and other 
members of the ASIC family are not likely to be amenable to alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding AS3 proteins that contain changes in amino acid residues that are not essential for 

30 activity. Such AS3 proteins differ in amino acid sequence from SEQ ID NO:2, yet retain 
biological activity. In one embodiment, the isolated nucleic acid molecule comprises a 
nucleotide sequence encoding a protein, wherein the protein comprises an amino acid 
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sequence at least about 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% 
or more homologous to SEQ ID NO:2. 

An isolated nucleic acid molecule encoding an AS3 protein homologous to the 
protein of SEQ ID NO:2 can be created by introducing one or more nucleotide substitutions, 
5 additions or deletions into the nucleotide sequence of SEQ ID NO: 1 or 3, or the nucleotide 
sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 

, such that one or more amino acid substitutions, additions or deletions are introduced 

into the encoded protein. Mutations can be introduced into SEQ ID NO:l or 3, or the 
nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as Accession 

10 Number by standard techniques, such as site-directed mutagenesis and PCR-mediated 

mutagenesis. Preferably, conservative amino acid substitutions are made at one or more 
predicted non-essential amino acid residues. A "conservative amino acid substitution" is one 
in which the amino acid residue is replaced with an amino acid residue having a similar side 
chain. Families of amino acid residues having similar side chains have been defined in the 

15 art. These families include amino acids with basic side chains (e.g., lysine, arginine, 

histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains 
(e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side 
chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side 

20 chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential 
amino acid residue in an AS 3 protein is preferably replaced with another amino acid residue 
from the same side chain family. Alternatively, in another embodiment, mutations can be 
introduced randomly along all or part of an AS3 coding sequence, such as by saturation 
mutagenesis, and the resultant mutants can be screened for AS3 biological activity to identify 

25 mutants that retain activity. Following mutagenesis of SEQ ID NO: 1 or 3, or the nucleotide 
sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 

, the encoded protein can be expressed recombinantly and the activity of the protein 

can be determined. 

In a preferred embodiment, a mutant AS3 protein can be assayed for the ability to 
30 ability to modulate cell proliferation, for example, hormone-induced changes in cell 
proliferation. 
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In addition to the nucleic acid molecules encoding AS3 proteins described above, 
another aspect of the invention pertains to isolated nucleic acid molecules which are 
antisense thereto. An "antisense" nucleic acid comprises a nucleotide sequence which is 
complementary to a "sense" nucleic acid encoding a protein, e.g., complementary to the 
5 coding strand of a double-stranded cDNA molecule or complementary to an mRNA 

sequence. Accordingly, an antisense nucleic acid can hydrogen bond to a sense nucleic acid. 
The antisense nucleic acid can be complementary to an entire AS3 coding strand, or to only a 
portion thereof. In one embodiment, an antisense nucleic acid molecule is antisense to a 
"coding region" of the coding strand of a nucleotide sequence encoding AS3. The term 

1 0 "coding region" refers to the region of the nucleotide sequence comprising codons which are 
translated into amino acid residues (e.g., the coding region of human AS3 corresponds to 
SEQ ID NO:3). In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence encoding AS3. The term 
"noncoding region" refers to 5 r and 3' sequences which flank the coding region that are not 

15 translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions). 

Given the coding strand sequences encoding AS3 disclosed herein (e.g., SEQ ID 
NO:3), antisense nucleic acids of the invention can be designed according to the rules of 
Watson and Crick base pairing. The antisense nucleic acid molecule can be complementary 
to the entire coding region of AS3 mRNA, but more preferably is an oligonucleotide which is 

20 antisense to only a portion of the coding or noncoding region of AS3 mRNA. For example, 
the antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of AS3 mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 
20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis and enzymatic ligation reactions using 

25 procedures known in the art. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the biological stability of the molecules 
or to increase the physical stability of the duplex formed between the antisense and sense 
nucleic acids, e.g., phosphorothioate derivatives and acridine substituted nucleotides can be 

30 used. Examples of modified nucleotides which can be used to generate the antisense nucleic 
acid include 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, 
xantine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
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2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine 5 3-methylcytosine, 5-methylcytosine 5 N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
5 mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2- 
thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3- 
N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the antisense 

10 nucleic acid can be produced biologically using an expression vector into which a nucleic 
acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the inserted 
nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described 
further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 

1 5 subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding an AS3 protein to thereby inhibit expression of the protein, e.g., by 
inhibiting transcription and/or translation. The hybridization can be by conventional 
nucleotide complementarity to form a stable duplex, or, for example, in the case of an 
antisense nucleic acid molecule which binds to DNA duplexes, through specific interactions 

20 in the major groove of the double helix. An example of a route of administration of 

antisense nucleic acid molecules of the invention include direct injection at a tissue site. 
Alternatively, antisense nucleic acid molecules can be modified to target selected cells and 
then administered systemically. For example, for systemic administration, antisense 
molecules can be modified such that they specifically bind to receptors or antigens 

25 expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules to 
peptides or antibodies which bind to cell surface receptors or antigens. The antisense 
nucleic acid molecules can also be delivered to cells using the vectors described herein. To 
achieve sufficient intracellular concentrations of the antisense molecules, vector constructs 
in which the antisense nucleic acid molecule is placed under the control of a strong pol II or 

30 pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
oc-anomeric nucleic acid molecule. An oc-anomeric nucleic acid molecule forms specific 
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double-stranded hybrids with complementary RNA in which, contrary to the usual [3-units, 
the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids. Res. 15:6625- 
6641). The antisense nucleic acid molecule can also comprise a 2 , -o-methylribonucleotide 
(Inoue et al. (1987) Nucleic Acids Res. 15:6131-6148) or a chimeric RNA-DNA analogue 
5 (Inoue et al. (1987) FEES Lett 215:327-330). 

In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity which are capable of 
cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 
complementary region. Thus, ribozymes (e.g., hammerhead ribozymes (described in 

10 Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave AS3 
mRNA transcripts to thereby inhibit translation of AS3 mRNA. A ribozyme having 
specificity for an AS3-encoding nucleic acid can be designed based upon the nucleotide 
sequence of an AS3 cDNA disclosed herein (i.e., SEQ ID NO:l or 3, or the nucleotide 
sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 

1 5 ). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in 

which the nucleotide sequence of the active site is complementary to the nucleotide 
sequence to be cleaved in an AS3-encoding mRNA. See, e.g., Cech et al. U.S. Patent No. 
4,987,071 ; and Cech et al. U.S. Patent No. 5,1 16,742. Alternatively, AS3 mRNA can be 
used to select a catalytic RNA having a specific ribonuclease activity from a pool of RNA 

20 molecules. See, e.g., Bartel, D. and Szostak, J.W. (1993) Science 261:141 1-1418. 

Alternatively, AS3 gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the AS3 (e.g., the AS3 promoter 
and/or enhancers) to form triple helical structures that prevent transcription of the AS 3 gene 
in target cells. See generally, Helene, C. (1991) Anticancer Drug Des. 6(6):569-84; Helene, 

25 C. et al. (1992) Ann. N Y. Acad. Set 660:27-36; and Maher, L J. (1992) Bioassays 
14(12):807-15. 

In yet another embodiment, the AS3 nucleic acid molecules of the present invention 
can be modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., 
the stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
30 phosphate backbone of the nucleic acid molecules can be modified to generate peptide 
nucleic acids (see Hyrup B. et al. (1996) Bioorganic & Medicinal Chemistry 4 (1): 5-23). 
As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, 



MBI-008 

-25- 

e.g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA 
under conditions of low ionic strength. The synthesis of PNA oligomers can be performed 
5 using standard solid phase peptide synthesis protocols as described in Hyrup B. et al. (1996) 
supra; Perry-O'Keefe et al. Proc. Natl. Acad ScL 93: 14670-675. 

PNAs of AS3 nucleic acid molecules can be used in therapeutic and diagnostic 
applications. For example, PNAs can be used as antisense or antigene agents for sequence- 
specific modulation of gene expression by, for example, inducing transcription or translation 

10 arrest or inhibiting replication. PNAs of AS3 nucleic acid molecules can also be used in the 
analysis of single base pair mutations in a gene, (e.g., by PNA-directed PCR clamping); as 
'artificial restriction enzymes' when used in combination with other enzymes, (e.g., SI 
nucleases (Hyrup B. (1996) supra)); or as probes or primers for DNA sequencing or 
hybridization (Hyrup B. et al. (1996) supra; Perry-O'Keefe supra). 

1 5 In another embodiment, PNAs of AS3 can be modified, (e.g., to enhance their 

stability or cellular uptake), by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery known in the art. For example, PNA-DNA chimeras of AS3 nucleic acid molecules 
can be generated which may combine the advantageous properties of PNA and DNA. Such 

20 chimeras allow DNA recognition enzymes, (e.g., RNAse H and DNA polymerases), to 
interact with the DNA portion while the PNA portion would provide high binding affinity 
and specificity. PNA-DNA chimeras can be linked using linkers of appropriate lengths 
selected in terms of base stacking, number of bonds between the nucleobases, and 
orientation (Hyrup B. (1996) supra). The synthesis of PNA-DNA chimeras can be 

25 performed as described in Hyrup B. (1996) supra and Finn P J. et al. (1996) Nucleic Acids 
Res. 24 (17): 3357-63. For example, a DNA chain can be synthesized on a solid support 
using standard phosphoramidite coupling chemistry and modified nucleoside analogs, e.g., 
5'-(4- me thoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used as a between 
the PNA and the 5' end of DNA (Mag, M. et al. (1 989) Nucleic Acid Res. 17: 5973-88). 

30 PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule with 
a 5' PNA segment and a 3 f DNA segment (Finn P.J. et al. (1996) supra). Alternatively, 
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chirneric molecules can be synthesized with a 5 f DNA segment and a 3' PNA segment 
(Peterser, K.H. et al. (1975) Bioorganic Med, Chem, Lett. 5:1119-11 124). 

In other embodiments, the oligonucleotide may include other appended groups such 
as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport 
5 across the cell membrane (see, e.g., Letsinger et al. (1989) Proc. Natl Acad Scl USA 
86:6553-6556; Lemaitre et al (1987) Proc. Natl Acad Scl USA 84:648-652; PCT 
Publication No. W088/09810) or the blood-brain barrier (see, e.g., PCT Publication No. 
W089/10134). In addition, oligonucleotides can be modified with hybridization-triggered 
cleavage agents (See, e.g., Krol et al. (1988) Bio-Techniques 6:958-976) or intercalating 
10 agents. (See, e.g., Zon (1988) Pharm. Res. 5:539-549). To this end, the oligonucleotide 
may be conjugated to another molecule, (e.g., a peptide, hybridization triggered cross- 
linking agent, transport agent, or hybridization-triggered cleavage agent). 

II. Isolated AS 3 Proteins 

15 One aspect of the invention pertains to isolated AS3 proteins, and biologically active 

portions thereof, as well as polypeptide fragments suitable for use as immunogens to raise 
anti-AS3 antibodies. In one embodiment, native AS3 proteins can be isolated from cells or 
tissue sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, AS3 proteins are produced by recombinant DNA 

20 techniques. Alternative to recombinant expression, an AS3 protein or polypeptide can be 
synthesized chemically using standard peptide synthesis techniques. 

An "isolated" or "purified" protein or biologically active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or tissue 
source from which the AS3 protein is derived, or substantially free from chemical precursors 

25 or other chemicals when chemically synthesized. The language "substantially free of 
cellular material" includes preparations of AS3 protein in which the protein is separated 
from cellular components of the cells from which it is isolated or recombinantly produced. 
In one embodiment, the language "substantially free of cellular material" includes 
preparations of AS3 protein having less than about 30% (by dry weight) of non-AS3 protein 

30 (also referred to herein as a "contaminating protein"), more preferably less than about 20% 
of non-AS3 protein, still more preferably less than about 10% of non-AS3 protein, and most 
preferably less than about 5% non-AS3 protein. When the AS3 protein or biologically 
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active portion thereof is recombinantly produced, it is also preferably substantially free of 
culture medium, i.e., culture medium represents less than about 20%, more preferably less 
than about 10%, and most preferably less than about 5% of the volume of the protein 
preparation. 

5 The language "substantially free of chemical precursors or other chemicals" includes 

preparations of AS3 protein in which the protein is separated from chemical precursors or 
other chemicals which are involved in the synthesis of the protein. In one embodiment, the 
language "substantially free of chemical precursors or other chemicals" includes 
preparations of AS3 protein having less than about 30% (by dry weight) of chemical 
10 precursors or non-AS3 chemicals, more preferably less than about 20% chemical precursors 
or non-AS3 chemicals, still more preferably less than about 10% chemical precursors or 
non-AS3 chemicals, and most preferably less than about 5% chemical precursors or non- 
AS3 chemicals. 

As used herein, a "biologically active portion" of an AS3 protein includes a fragment 
15 of an AS3 protein which participates in an interaction between an AS3 molecule and a non- 
AS3 molecule. Biologically active portions of an AS3 protein include peptides comprising 
amino acid sequences sufficiently homologous to or derived from the amino acid sequence 
of the AS3 protein, e.g., the amino acid sequence shown in SEQ ID NO:2, which include 
less amino acids than the full length AS3 proteins, and exhibit at least one activity of an AS3 
20 protein. Typically, biologically active portions comprise a domain or motif with at least one 
activity of the AS3 protein, e.g., the ability to modulate cell proliferation. A biologically 
active portion of an AS3 protein can be a polypeptide which is, for example, 10, 25, 50, 100, 
200 or more amino acids in length. Biologically active portions of an AS3 protein can be 
used as targets for developing agents which modulate an AS3 mediated activity, e.g., cell 
25 proliferation. 

In one embodiment, a biologically active portion of an AS3 protein comprises at 
least one heptad repeat. It is to be understood that a preferred biologically active portion of 
an AS3 protein of the present invention may contain two, three, four, or five heptad repeats. 
As used herein, the term "heptad repeat" includes a protein domain which contains either a 
30 leucine or similar hydrophobic residue (e.g., isoleucine, valine, or tyrosine) in every seventh 
position over a stretch of at least 20-30 amino acid residues, more preferably at least 20-25 
amino acid residues, and most preferably at least 22 amino acid residues. Typically, the 
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heptad repeat is uninterrupted, and can participate in protein-protein interactions. The 
leucine-zipper motif of DNA binding proteins is a specific subclass of this general pattern 
and is encompassed by the above term. Leucine zipper domains are described in, for 
example, Landschultz et al. (1988) Science 240: 1759-1764, the contents of which are 
5 incorporated herein by reference. Amino acid residues 55-161, 196-217, 241-277, 319-355, 
and 375-404 of the AS3 protein all comprise heptad repeats. 

In another embodiment, a biologically active portion of an AS3 protein comprises a 
least one kinase-related domain. It is understood that a preferred biologically active portion 
of an AS3 protein of the present invention may contain two, three, four, five, six, seven, 

10 eight, or nine kinase-related domains. As used herein, the term "kinase-related domain" 
includes a polypeptide consensus motif having high homology to a known protein kinase 
catalytic domain as described in Hanks et al. (1988) Meth Enzymol 200:38-62 (the 
contents of which are incorporated herein by reference). In particular, a kinase-related 
domain is any one of the nine consensus motifs or related sequences set forth in Fig. 3. 

15 In another embodiment, a biologically active portion of an AS3 protein can have 

kinase activity. As referred to herein, "kinase activity" is an activity associated with a 
protein or polypeptide which is capable of modulating its own phosphorylation state or the 
phosphorylation state of another protein or polypeptide. Proteins with kinase activity play a 
role in signaling pathways associated with cellular growth. 

20 Moreover, other biologically active portions, in which other regions of the protein are 

deleted, can be prepared by recombinant techniques and evaluated for one or more of the 
functional activities of a native AS3 protein. 

In a preferred embodiment, the AS 3 protein has an amino acid sequence shown in 
SEQ ID NO:2. In other embodiments, the AS3 protein is substantially homologous to SEQ 

25 ID NO:2, and retains the functional activity of the protein of SEQ ID NO:2, yet differs in 
amino acid sequence due to natural allelic variation or mutagenesis, as described in detail in 
subsection I above. Accordingly, in another embodiment, the AS3 protein is a protein 
which comprises an amino acid sequence at least about 45%, 50%, 55%, 60%, 65%, 70%, 
75%, 80%, 85%, 90%, 95%, 98% or more homologous to SEQ ID NO:2. 

30 To determine the percent identity of two amino acid sequences or of two nucleic acid 

sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in one or both of a first and a second amino acid or nucleic acid sequence for 
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optimal alignment and non-homologous sequences can be disregarded for comparison 
purposes). In a preferred embodiment, the length of a reference sequence aligned for 
comparison purposes is at least 30%, preferably at least 40%, more preferably at least 50%, 
even more preferably at least 60%, and even more preferably at least 70%>, 80%), or 90%> of 
5 the length of the reference sequence (e.g., when aligning a second sequence to the AS3 
amino acid sequence of SEQ ID NO: 2 having 400 amino acid residues, at least 80, 
preferably at least 100, more preferably at least 120, even more preferably at least 140, and 
even more preferably at least 150, 200, 300, or 400 amino acid residues are aligned). The 
amino acid residues or nucleotides at corresponding amino acid positions or nucleotide 
1 0 positions are then compared. When a position in the first sequence is occupied by the same 
amino acid residue or nucleotide as the corresponding position in the second sequence, then 
the molecules are identical at that position (as used herein amino acid or nucleic acid 
"identity" is equivalent to amino acid or nucleic acid "homology"). The percent identity 
between the two sequences is a function of the number of identical positions shared by the 
1 5 sequences, taking into account the number of gaps, and the length of each gap, which need 
to be introduced for optimal alignment of the two sequences. 

The comparison of sequences and determination of percent identity between two 
sequences can be accomplished using a mathematical algorithm. In a preferred embodiment, 
the percent identity between two amino acid sequences is determined using the Needleman 
20 and Wunsch {J. Mol Biol (48):444-453 (1970)) algorithm which has been incorporated into 
the GAP program in the GCG software package (available at http://www.gcg.com), using 
either a Blossom 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 
4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent 
identity between two nucleotide sequences is determined using the GAP program in the 
25 GCG software package (available at http://www.gcg.com), using a NWSgapdna.CMP 

matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In 
another embodiment, the percent identity between two amino acid or nucleotide sequences is 
determined using the algorithm of E. Meyers and W. Miller (Comput. Appl Bioscu, 4:1 1-17 
(1988)) which has been incorporated into the ALIGN program (version 2.0), using a 
30 PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be used 
as a "query sequence" to perform a search against public databases to, for example, identify 
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other family members or related sequences. Such searches can be performed using the 
NBLAST and XBLAST programs (version 2.0) of Altschul, et al. (1990) J. Mol Biol 
215:403-10. BLAST nucleotide searches can be performed with the NBLAST program, 
score = 100, wordlength = 12 to obtain nucleotide sequences homologous to AS3 nucleic 
acid molecules of the invention. BLAST protein searches can be performed with the 
XBLAST program, score = 50, wordlength = 3 to obtain amino acid sequences homologous 
to AS3 protein molecules of the invention. To obtain gapped alignments for comparison 
purposes, Gapped BLAST can be utilized as described in Altschul et al., (1997) Nucleic 
Acids Res. 25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, the 
default parameters of the respective programs (e.g., XBLAST and NBLAST) can be used. 
See http://www.ncbi.nlm.nih.gov. 

The invention also provides AS3 chimeric or fusion proteins. As used herein, an 
AS3 "chimeric protein" or "fusion protein" comprises an AS3 polypeptide operatively linked 
to a non-AS3 polypeptide. An " AS3 polypeptide" refers to a polypeptide having an amino 
acid sequence corresponding to AS3, whereas a "non-AS3 polypeptide" refers to a 
polypeptide having an amino acid sequence corresponding to a protein which is not 
substantially homologous to the AS3 protein, e.g., a protein which is different from the AS3 
protein and which is derived from the same or a different organism. Within an AS3 fusion 
protein the AS3 polypeptide can correspond to all or a portion of an AS3 protein. In a 
preferred embodiment, an AS3 fusion protein comprises at least one biologically active 
portion of an AS3 protein. In another preferred embodiment, an AS3 fusion protein 
comprises at least two biologically active portions of an AS3 protein. Within the fusion 
protein, the term "operatively linked" is intended to indicate that the AS3 polypeptide and 
the non-AS3 polypeptide are fused in-frame to each other. The non-AS3 polypeptide can be 
fused to the N-terminus or C-terminus of the AS3 polypeptide. 

For example, in one embodiment, the fusion protein is a GST-AS3 fusion protein in 
which the AS3 sequences are fused to the C-terminus of the GST sequences. Such fusion 
proteins can facilitate the purification of recombinant AS3. 

In another embodiment, the fusion protein is an AS3 protein containing a 
heterologous signal sequence at its N-terminus. In certain host cells (e.g., mammalian host 
cells), expression and/or secretion of AS3 can be increased through use of a heterologous 
signal sequence. 
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The AS3 fusion proteins of the invention can be incorporated into pharmaceutical 
compositions and administered to a subject in vivo. The AS3 fusion proteins can be used to 
affect the bioavailability of an AS3 substrate. Use of AS3 fusion proteins may be useful 
therapeutically for the treatment of disorders caused by, for example, (i) aberrant 
5 modification or mutation of a gene encoding an AS3 protein; (ii) mis-regulation of the AS3 
gene; and (iii) aberrant post-translational modification of an AS3 protein. 

Moreover, the AS3 -fusion proteins of the invention can be used as immunogens to 
produce anti-AS3 antibodies in a subject, to purify AS3 ligands and in screening assays to 
identify molecules which inhibit the interaction of AS3 with an AS 3 substrate. 

1 0 Preferably, an AS3 chimeric or fusion protein of the invention is produced by 

standard recombinant DNA techniques. For example, DNA fragments coding for the 
different polypeptide sequences are ligated together in-frame in accordance with 
conventional techniques, for example by employing blunt-ended or stagger-ended termini 
for ligation, restriction enzyme digestion to provide for appropriate termini, filling-in of 

15 cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining, 
and enzymatic ligation. In another embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers which give rise to 
complementary overhangs between two consecutive gene fragments which can subsequently 

20 be annealed and reamplifled to generate a chimeric gene sequence (see, for example, 
Current Protocols in Molecular Biology, eds. Ausubel et al. John Wiley & Sons: 1992). 
Moreover, many expression vectors are commercially available that already encode a fusion 
moiety (e.g., a GST polypeptide). An AS3-encoding nucleic acid can be cloned into such an 
expression vector such that the fusion moiety is linked in-frame to the AS3 protein. 

25 The present invention also pertains to variants of the AS3 proteins which function as 

either AS3 agonists (mimetics) or as AS3 antagonists. Variants of the AS3 proteins can be 
generated by mutagenesis, e.g., discrete point mutation or truncation of an AS3 protein. An 
agonist of the AS3 proteins can retain substantially the same, or a subset, of the biological 
activities of the naturally occurring form of an AS3 protein. An antagonist of an AS3 

30 protein can inhibit one or more of the activities of the naturally occurring form of the AS3 
protein by, for example, competitively modulating an AS3-mediated activity of an AS3 
protein. Thus, specific biological effects can be elicited by treatment with a variant of 
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limited function. In one embodiment, treatment of a subject with a variant having a subset 
of the biological activities of the naturally occurring form of the protein has fewer side 
effects in a subject relative to treatment with the naturally occurring form of the AS 3 
protein. 

5 In one embodiment, variants of an AS3 protein which function as either AS3 

agonists (mimetics) or as AS3 antagonists can be identified by screening combinatorial 
libraries of mutants, e.g., truncation mutants, of an AS3 protein for AS3 protein agonist or 
antagonist activity. In one embodiment, a variegated library of AS3 variants is generated by 
combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene 

10 library. A variegated library of AS3 variants can be produced by, for example, 

enzymatically ligating a mixture of synthetic oligonucleotides into gene sequences such that 
a degenerate set of potential AS3 sequences is expressible as individual polypeptides, or 
alternatively, as a set of larger fusion proteins (e.g., for phage display) containing the set of 
AS3 sequences therein. There are a variety of methods which can be used to produce 

15 libraries of potential AS3 variants from a degenerate oligonucleotide sequence. Chemical 
synthesis of a degenerate gene sequence can be performed in an automatic DNA synthesizer, 
and the synthetic gene then ligated into an appropriate expression vector. Use of a 
degenerate set of genes allows for the provision, in one mixture, of all of the sequences 
encoding the desired set of potential AS3 sequences. Methods for synthesizing degenerate 

20 oligonucleotides are known in the art (see, e.g., Narang, S.A. (1983) Tetrahedron 39:3; 

Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike 
etal. (1983) Nucleic Acid Res. 11:477. 

In addition, libraries of fragments of an AS3 protein coding sequence can be used to 
generate a variegated population of AS3 fragments for screening and subsequent selection of 

25 variants of an AS3 protein. In one embodiment, a library of coding sequence fragments can 
be generated by treating a double stranded PCR fragment of an AS3 coding sequence with a 
nuclease under conditions wherein nicking occurs only about once per molecule, denaturing 
the double stranded DNA, renaturing the DNA to form double stranded DNA which can 
include sense/antisense pairs from different nicked products, removing single stranded 

30 portions from reformed duplexes by treatment with SI nuclease, and ligating the resulting 
fragment library into an expression vector. By this method, an expression library can be 
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derived which encodes N-terminal, C-terminal and internal fragments of various sizes of the 
AS 3 protein. 

Several techniques are known in the art for screening gene products of combinatorial 
libraries made by point mutations or truncation, and for screening cDNA libraries for gene 
5 products having a selected property. Such techniques are adaptable for rapid screening of 
the gene libraries generated by the combinatorial mutagenesis of AS 3 proteins. The most 
widely used techniques, which are amenable to high through-put analysis, for screening 
large gene libraries typically include cloning the gene library into replicable expression 
vectors, transforming appropriate cells with the resulting library of vectors, and expressing 

1 0 the combinatorial genes under conditions in which detection of a desired activity facilitates 
isolation of the vector encoding the gene whose product was detected. Recrusive ensemble 
mutagenesis (REM), a new technique which enhances the frequency of functional mutants in 
the libraries, can be used in combination with the screening assays to identify AS3 variants 
(Arkin and Yourvan (1992) Proc. Natl Acad. Sci. USA 89:781 1-7815; Delgrave et al. 

15 (1993) Protein Engineering 6(3):327-331). 

In one embodiment, cell based assays can be exploited to analyze a variegated AS3 
library. For example, a library of expression vectors can be transfected into a cell line, e.g., 
a neuronal cell line, which ordinarily responds to a particular ligand in an AS3 -dependent 
manner. The transfected cells are then contacted with the ligand and the effect of expression 

20 of the mutant on signaling by the ligand can be detected, e.g., by measuring intracellular 
calcium, potassium, or sodium concentration, neuronal membrane depolarization, or the 
activity of an AS3-regulated transcription factor. Plasmid DNA can then be recovered from 
the cells which score for inhibition, or alternatively, potentiation of signaling by the ligand, 
and the individual clones further characterized. 

25 

III Anti-AS3 Antibodies 

An isolated AS3 protein, or a portion or fragment thereof, can be used as an 
immunogen to generate antibodies that bind AS3 using standard techniques for polyclonal 
and monoclonal antibody preparation. A full-length AS3 protein can be used or, 
30 alternatively, the invention provides antigenic peptide fragments of AS3 for use as 

immunogens. The antigenic peptide of AS3 comprises at least 8 amino acid residues of the 
amino acid sequence shown in SEQ ID NO:2 and encompasses an epitope of AS3 such that 
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an antibody raised against the peptide forms a specific immune complex with AS3. 
Preferably, the antigenic peptide comprises at least 10 amino acid residues, more preferably 
at least 1 5 amino acid residues, even more preferably at least 20 amino acid residues, and 
most preferably at least 30 amino acid residues. 
5 Preferred epitopes encompassed by the antigenic peptide are regions of AS3 that are 

located on the surface of the protein, e.g., hydrophilic regions, as well as regions with high 
antigenicity (see, for example, Figure 2). 

An AS3 immunogen typically is used to prepare antibodies by immunizing a suitable 
subject, (e.g., rabbit, goat, mouse or other mammal) with the immunogen. An appropriate 

10 immunogenic preparation can contain, for example, recombinantly expressed AS 3 protein or 
a chemically synthesized AS3 polypeptide. The preparation can further include an adjuvant, 
such as Freund's complete or incomplete adjuvant, or similar immunostimulatory agent. 
Immunization of a suitable subject with an immunogenic AS3 preparation induces a 
polyclonal anti-AS3 antibody response. 

1 5 Accordingly, another aspect of the invention pertains to anti-AS3 antibodies. The 

term "antibody" as used herein refers to immunoglobulin molecules and immunologically 
active portions of immunoglobulin molecules, i.e., molecules that contain an antigen binding 
site which specifically binds (immunoreacts with) an antigen, such as AS3. Examples of 
immunologically active portions of immunoglobulin molecules include F(ab) and F(ab f ) 2 

20 fragments which can be generated by treating the antibody with an enzyme such as pepsin. 
The invention provides polyclonal and monoclonal antibodies that bind AS 3. The term 
"monoclonal antibody" or "monoclonal antibody composition", as used herein, refers to a 
population of antibody molecules that contain only one species of an antigen binding site 
capable of immunoreacting with a particular epitope of AS3. A monoclonal antibody 

25 composition thus typically displays a single binding affinity for a particular AS3 protein 
with which it immunoreacts. 

Polyclonal anti-AS3 antibodies can be prepared as described above by immunizing a 
suitable subject with an AS3 immunogen. The anti-AS3 antibody titer in the immunized 
subject can be monitored over time by standard techniques, such as with an enzyme linked 

30 immunosorbent assay (ELISA) using immobilized AS3. If desired, the antibody molecules 
directed against AS3 can be isolated from the mammal (e.g., from the blood) and further 
purified by well known techniques, such as protein A chromatography to obtain the IgG 
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fraction. At an appropriate time after immunization, e.g., when the anti-AS3 antibody titers 
are highest, antibody-producing cells can be obtained from the subject and used to prepare 
monoclonal antibodies by standard techniques, such as the hybridoma technique originally 
described by Kohler and Milstein (1975) Nature 256:495-497) (see also, Brown et al. (1981) 
5 J. Immunol 127:539-46; Brown et al. (1980) J. Biol Chem .255:4980-83; Yeh et al. (1976) 
Proa Natl Acad ScL USA 76:2927-31; and Yeh et al. (1982) Int. J. Cancer 29:269-75), the 
more recent human B cell hybridoma technique (Kozbor et al. (1983) Immunol Today 4:72), 
the EBV-hybridoma technique (Cole et al. (1985), Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96) or trioma techniques. The technology for producing 

10 monoclonal antibody hybridomas is well known (see generally R. H. Kenneth, in 

Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum Publishing 
Corp., New York, New York (1980); E. A. Lerner (1981) Yale J. Biol Med } 54:387-402; M. 
L. Gefter et al. (1977) Somatic Cell Genet 3:231-36). Briefly, an immortal cell line 
(typically a myeloma) is fused to lymphocytes (typically splenocytes) from a mammal 

1 5 immunized with an AS3 immunogen as described above, and the culture supernatants of the 
resulting hybridoma cells are screened to identify a hybridoma producing a monoclonal 
antibody that binds AS3. 

Any of the many well known protocols used for fusing lymphocytes and 
immortalized cell lines can be applied for the purpose of generating an anti-AS3 monoclonal 

20 antibody (see, e.g., G. Galfre et al. (1977) Nature 266:55052; Gefter et al. Somatic Cell 
Genet., cited supra; Lerner, Yale J. Biol Med, cited supra; Kenneth, Monoclonal 
Antibodies, cited supra). Moreover, the ordinarily skilled worker will appreciate that there 
are many variations of such methods which also would be useful. Typically, the immortal 
cell line (e.g., a myeloma cell line) is derived from the same mammalian species as the 

25 lymphocytes. For example, murine hybridomas can be made by fusing lymphocytes from a 
mouse immunized with an immunogenic preparation of the present invention with an 
immortalized mouse cell line. Preferred immortal cell lines are mouse myeloma cell lines 
that are sensitive to culture medium containing hypoxanthine, aminopterin and thymidine 
("HAT medium"). Any of a number of myeloma cell lines can be used as a fusion partner 

30 according to standard techniques, e.g., the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/0- 
Agl4 myeloma lines. These myeloma lines are available from ATCC. Typically, HAT- 
sensitive mouse myeloma cells are fused to mouse splenocytes using polyethylene glycol 
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("PEG"). Hybridoma cells resulting from the fusion are then selected using HAT medium, 
which kills unfused and unproductively fused myeloma cells (unfused splenocytes die after 
several days because they are not transformed). Hybridoma cells producing a monoclonal 
antibody of the invention are detected by screening the hybridoma culture supernatants for 
5 antibodies that bind AS3, e.g., using a standard ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a monoclonal 
anti-AS3 antibody can be identified and isolated by screening a recombinant combinatorial 
immunoglobulin library (e.g., an antibody phage display library) with AS3 to thereby isolate 
immunoglobulin library members that bind AS3. Kits for generating and screening phage 

10 display libraries are commercially available (e.g., the Pharmacia Recombinant Phage 

Antibody System, Catalog No. 27-9400-01 ; and the Stratagene SurfZAP™ Phage Display 
Kit, Catalog No. 240612). Additionally, examples of methods and reagents particularly 
amenable for use in generating and screening antibody display library can be found in, for 
example, Ladner et al. U.S. Patent No. 5,223,409; Kang et al. PCT International Publication 

15 No. WO 92/18619; Dower et al. PCT International Publication No. WO 91/17271; Winter et 
al. PCT International Publication WO 92/20791; Markland et al. PCT International 
Publication No. WO 92/15679; Breitling et al. PCT International Publication WO 93/01288; 
McCafferty et al. PCT International Publication No. WO 92/01047; Garrard et al. PCT 
International Publication No. WO 92/09690; Ladner et al. PCT International Publication No. 

20 WO 90/02809; Fuchs et al. (1991) Bio/Technology 9:1370-1372; Hay et al. (1992) Hum. 
Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275-1281; Griffiths et al. 
(1993) EMBO J 12:725-734; Hawkins et al. (1992) J. Mol Biol 226:889-896; Clarkson et 
al. (1991) Nature 352:624-628; Gram et al. (1992) Proc. Natl Acad Scl USA 89:3576- 
3580; Garrad et al. (1991) Bio/Technology 9:1373-1377; Hoogenboom et al. (1991) Nuc. 

25 Acid Res, 19:4133-4137; Barbas et al. (1991) Proc. Natl Acad Scl USA 88:7978-7982; and 
McCafferty et al. Nature (1990) 348:552-554. 

Additionally, recombinant anti-AS3 antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be 
made using standard recombinant DNA techniques, are within the scope of the invention. 

30 Such chimeric and humanized monoclonal antibodies can be produced by recombinant DNA 
techniques known in the art, for example using methods described in Robinson et al. 
International Application No. PCT/US86/02269; Akira, et al. European Patent Application 
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184,187; Taniguchi, M., European Patent Application 171,496; Morrison et al. European 
Patent Application 173,494; Neuberger et al. PCT International Publication No. WO 
86/01533; Cabilly et al. U.S. Patent No. 4,816,567; Cabilly et al European Patent 
Application 125,023; Better et al. (1988) Science 240:1041-1043; Liu et al. (1987) Proc. 

5 Natl Acad Set USA 84:3439-3443; Liu et al. (1987) J. Immunol 139:3521-3526; Sun et al. 
(1987) Proc. Natl Acad Scl USA 84:214-218; Nishimura et al. (1987) Cane. Res. 47:999- 
1005; Wood et al. (1985) Nature 314:446-449; and Shaw et al. (1988) J. Natl Cancer Inst 
80:1553-1559); Morrison, S. L. (1985) Science 229:1202-1207; Oi et al. (1986) 
BioTechniques 4:214; Winter U.S. Patent 5,225,539; Jones et al. (1986) Nature 321 :552- 

10 525; Verhoeyan et al. (1988) Science 239:1534; and Beidler et al. (1988) J. Immunol 
141:4053-4060. 

An anti-AS3 antibody (e.g., monoclonal antibody) can be used to isolate AS3 by 
standard techniques, such as affinity chromatography or immunoprecipitation. An anti-AS3 
antibody can facilitate the purification of natural AS3 from cells and of recombinantly 

1 5 produced AS3 expressed in host cells. Moreover, an anti-AS3 antibody can be used to 
detect AS3 protein (e.g., in a cellular lysate or cell supernatant) in order to evaluate the 
abundance and pattern of expression of the AS 3 protein. Anti-AS3 antibodies can be used 
diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, e.g., 
to, for example, determine the efficacy of a given treatment regimen. Detection can be 

20 facilitated by coupling (i.e., physically linking) the antibody to a detectable substance. 

Examples of detectable substances include various enzymes, prosthetic groups, fluorescent 
materials, luminescent materials, bioluminescent materials, and radioactive materials. 
Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, - 
galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes 

25 include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials 
include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; examples of bioluminescent materials include 
luciferase, luciferin, and aequorin, and examples of suitable radioactive material include 

30 125 I, i31 I, 35 Sor 3 H. 
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IV. Recombinant Expression Vectors 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding an AS3 protein (or a portion thereof). As used herein, 
the term "vector" refers to a nucleic acid molecule capable of transporting another nucleic 
acid to which it has been linked. One type of vector is a "plasmid", which refers to a 
circular double stranded DNA loop into which additional DNA segments can be ligated. 
Another type of vector is a viral vector, wherein additional DNA segments can be ligated 
into the viral genome. Certain vectors are capable of autonomous replication in a host cell 
into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication 
and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) 
are integrated into the genome of a host cell upon introduction into the host cell, and thereby 
are replicated along with the host genome. Moreover, certain vectors are capable of 
directing the expression of genes to which they are operatively linked. Such vectors are 
referred to herein as "expression vectors". In general, expression vectors of utility in 
recombinant DNA techniques are often in the form of plasmids. In the present specification, 
"plasmid" and "vector" can be used interchangeably as the plasmid is the most commonly 
used form of vector. However, the invention is intended to include such other forms of 
expression vectors, such as viral vectors (e.g., replication defective retroviruses, 
adenoviruses and adeno-associated viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a form suitable for expression of the nucleic acid in a host cell, which means 
that the recombinant expression vectors include one or more regulatory sequences, selected 
on the basis of the host cells to be used for expression, which is operatively linked to the 
nucleic acid sequence to be expressed. Within a recombinant expression vector, "operably 
linked" is intended to mean that the nucleotide sequence of interest is linked to the 
regulatory sequence(s) in a manner which allows for expression of the nucleotide sequence 
(e.g., in an in vitro transcription/translation system or in a host cell when the vector is 
introduced into the host cell). The term "regulatory sequence" is intended to include 
promoters, enhancers and other expression control elements (e.g., polyadenylation signals). 
Such regulatory sequences are described, for example, in Goeddel; Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1990). 
Regulatory sequences include those which direct constitutive expression of a nucleotide 
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sequence in many types of host cells and those which direct expression of the nucleotide 
sequence only in certain host cells (e.g., tissue-specific regulatory sequences). It will be 
appreciated by those skilled in the art that the design of the expression vector can depend on 
such factors as the choice of the host cell to be transformed, the level of expression of 

5 protein desired, and the like. The expression vectors of the invention can be introduced into 
host cells to thereby produce proteins or peptides, including fusion proteins or peptides, 
encoded by nucleic acids as described herein (e.g., AS3 proteins, mutant forms of AS3 
proteins, fusion proteins, and the like). 

The recombinant expression vectors of the invention can be designed for expression 

10 of AS3 proteins in prokaryotic or eukaryotic cells. For example, AS3 proteins can be 
expressed in bacterial cells such as E. coli, insect cells (using baculovirus expression 
vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in 
Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, CA (1990). Alternatively, the recombinant expression vector can be transcribed and 

15 translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. coli with vectors 
containing constitutive or inducible promoters directing the expression of either fusion or 
non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded 
therein, usually to the amino terminus of the recombinant protein. Such fusion vectors 

20 typically serve three purposes: 1) to increase expression of recombinant protein; 2) to 
increase the solubility of the recombinant protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from the 

25 fusion moiety subsequent to purification of the fusion protein. Such enzymes, and their 
cognate recognition sequences, include Factor Xa, thrombin and enterokinase. Typical 
fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D.B. and Johnson, 
K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and pRJT5 
(Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), maltose E binding 

30 protein, or protein A, respectively, to the target recombinant protein. 

Purified fusion proteins can be utilized in AS3 activity assays, (e.g., direct assays or 
competitive assays described in detail below), or to generate antibodies specific for AS3 
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proteins, for example. In a preferred embodiment, an AS3 fusion protein expressed in a 
retroviral expression vector of the present invention can be utilized to infect bone marrow 
cells which are subsequently transplanted into irradiated recipients. The pathology of the 
subject recipient is then examined after sufficient time has passed (e.g., six (6) weeks). 

5 Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 

(Amann et al., (1988) Gene 69:301-315) and pET 1 Id (Studier et al., Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, California (1990) 
60-89). Target gene expression from the pTrc vector relies on host RNA polymerase 
transcription from a hybrid trp-lac fusion promoter. Target gene expression from the pET 

10 lid vector relies on transcription from a T7 gnlO-lac fusion promoter mediated by a 
coexpressed viral RNA polymerase (T7 gnl). This viral polymerase is supplied by host 
strains BL21(DE3) or HMS174(DE3) from a resident prophage harboring a T7 gnl gene 
under the transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express the 

1 5 protein in a host bacteria with an impaired capacity to proteolytically cleave the recombinant 
protein (Gottesman, S., Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 1 19-128). Another strategy is to alter the 
nucleic acid sequence of the nucleic acid to be inserted into an expression vector so that the 
individual codons for each amino acid are those preferentially utilized in E. coli (Wada et 

20 al., (1992) Nucleic Acids Res. 20:21 11-2118). Such alteration of nucleic acid sequences of 
the invention can be carried out by standard DNA synthesis techniques. 

In another embodiment, the AS3 expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerivisae include pYepSecl (Baldari, et al., 
(1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), 

25 pJRY88 (Schultz et al., (1987) Gene 54:1 13-123), pYES2 (Invitrogen Corporation, San 
Diego, CA), and picZ (InVitrogen Corp, San Diego, CA). 

Alternatively, AS3 proteins can be expressed in insect cells using baculovirus 
expression vectors. Baculovirus vectors available for expression of proteins in cultured 
insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al. (1983) Mol Cell Biol 

30 3:21 56-2 1 65) and the pVL series (Lucklow and Summers (1 989) Virology 1 70:3 1 -39). 
In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
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expression vectors include pCDM8 (Seed, B. (1987) Nature 329:840) and pMT2PC 
(Kaufman et al. (1987) EMBOJ. 6:187-195). When used in mammalian cells, the expression 
vector's control functions are often provided by viral regulatory elements. For example, 
commonly used promoters are derived from polyoma, Adenovirus 2, cytomegalovirus and 
5 Simian Virus 40. For other suitable expression systems for both prokaryotic and eukaryotic 
cells see chapters 16 and 17 of Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular 
Cloning: A Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 

In another embodiment, the recombinant mammalian expression vector is capable of 

10 directing expression of the nucleic acid preferentially in a particular cell type (e.g., tissue- 
specific regulatory elements are used to express the nucleic acid). Tissue-specific regulatory 
elements are known in the art. Non-limiting examples of suitable tissue-specific promoters 
include the prostate specific promoter (Gotoh et al (1998) J. Urol 60:220-229) albumin 
promoter (liver-specific; Pinkert et al. (1987) Genes Dev. 1:268-277), lymphoid-specific 

1 5 promoters (Calame and Eaton (1 988) Adv. Immunol. 43 :235-275), in particular promoters of 
T cell receptors (Winoto and Baltimore (1989) EMBO J. 8:729-733) and immunoglobulins 
(Banerji et al. (1983) Cell 33:729-740; Queen and Baltimore (1983) Cell 33:741-748), 
neuron-specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle (1989) 
Proc. Natl Acad. Sci. USA 86:5473-5477), pancreas-specific promoters (Edlund et al. 

20 (1985) Science 230:912-916), and mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. Patent No. 4,873,316 and European Application Publication No. 264,166). 
Developmentally-regulated promoters are also encompassed, for example the murine hox 
promoters (Kessel and Gruss (1990) Science 249:374-379) and the a-fetoprotein promoter 
(Campes and Tilghman (1989) Genes Dev. 3:537-546). 

25 The invention further provides a recombinant expression vector comprising a DNA 

molecule of the invention cloned into the expression vector in an antisense orientation. That 
is, the DNA molecule is operatively linked to a regulatory sequence in a manner which 
allows for expression (by transcription of the DNA molecule) of an RNA molecule which is 
antisense to AS3 mRNA. Regulatory sequences operatively linked to a nucleic acid cloned 

30 in the antisense orientation can be chosen which direct the continuous expression of the 
antisense RNA molecule in a variety of cell types, for instance viral promoters and/or 
enhancers, or regulatory sequences can be chosen which direct constitutive, tissue specific or 
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cell type specific expression of antisense RNA. The antisense expression vector can be in 
the form of a recombinant plasmid, phagemid or attenuated virus in which antisense nucleic 
acids are produced under the control of a high efficiency regulatory region, the activity of 
which can be determined by the cell type into which the vector is introduced. For a 
5 discussion of the regulation of gene expression using antisense genes see Weintraub, H. et 
al., Antisense RNA as a molecular tool for genetic analysis, Reviews - Trends in Genetics, 
Vol. 1(1) 1986, 

V. Host Cells 

10 Another aspect of the invention pertains to host cells into which an AS3 nucleic acid 

molecule of the invention is introduced, e.g., an AS3 nucleic acid molecule within a 
recombinant expression vector or an AS3 nucleic acid molecule containing sequences which 
allow it to homologously recombine into a specific site of the host cell's genome. The terms 
"host cell" and "recombinant host cell" are used interchangeably herein. It is understood 

1 5 that such terms refer not only to the particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications may occur in succeeding generations 
due to either mutation or environmental influences, such progeny may not, in fact, be 
identical to the parent cell, but are still included within the scope of the term as used herein. 
A host cell can be any prokaryotic or eukaryotic cell. For example, an AS3 protein 

20 can be expressed in bacterial cells such as E. colU insect cells, yeast or mammalian cells 
(such as Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are 
known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional 
transformation or transfection techniques. As used herein, the terms "transformation" and 

25 "transfection" are intended to refer to a variety of art-recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium 
chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or 
electroporation. Suitable methods for transforming or transfecting host cells can be found in 
Sambrook, et al. {Molecular Cloning: A Laboratory Manual 2nd, ed, Cold Spring Harbor 

30 Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989), and 
other laboratory manuals. 



MBI-008 

-43- 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these integrants, 
a gene that encodes a selectable marker (e.g., resistance to antibiotics) is generally 
introduced into the host cells along with the gene of interest. Preferred selectable markers 
include those which confer resistance to drugs, such as G41 8, hygromycin and methotrexate. 
Nucleic acid encoding a selectable marker can be introduced into a host cell on the same 
vector as that encoding an AS3 protein or can be introduced on a separate vector. Cells 
stably transfected with the introduced nucleic acid can be identified by drug selection (e.g., 
cells that have incorporated the selectable marker gene will survive, while the other cells 
die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, 
can be used to produce (i.e., express) an AS3 protein. Accordingly, the invention further 
provides methods for producing an AS3 protein using the host cells of the invention. In one 
embodiment, the method comprises culturing the host cell of the invention (into which a 
recombinant expression vector encoding an AS3 protein has been introduced) in a suitable 
medium such that an AS3 protein is produced. In another embodiment, the method further 
comprises isolating an AS3 protein from the medium or the host cell. 

VI. Transgenic Animals 

The host cells of the invention can also be used to produce non-human transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized oocyte 
or an embryonic stem cell into which AS3-coding sequences have been introduced. Such 
host cells can then be used to create non-human transgenic animals in which exogenous AS3 
sequences have been introduced into their genome or homologous recombinant animals in 
which endogenous AS3 sequences have been altered. Such animals are useful for studying 
the function and/or activity of an AS3 and for identifying and/or evaluating modulators of 
AS3 activity. As used herein, a "transgenic animal" is a non-human animal, preferably a 
mammal, more preferably a rodent such as a rat or mouse, in which one or more of the cells 
of the animal includes a transgene. Other examples of transgenic animals include non- 
human primates, sheep, dogs, cows, goats, chickens, amphibians, and the like. A transgene 
is exogenous DNA which is integrated into the genome of a cell from which a transgenic 
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animal develops and which remains in the genome of the mature animal, thereby directing 
the expression of an encoded gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, a "homologous recombinant animal" is a non-human 
animal, preferably a mammal, more preferably a mouse, in which an endogenous AS3 gene 
5 has been altered by homologous recombination between the endogenous gene and an 

exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic cell of the 
animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing an AS 3 -encoding 
nucleic acid into the male pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral 

10 infection, and allowing the oocyte to develop in a pseudopregnant female foster animal. The 
AS3 cDNA sequence of SEQ ID NO: 1 can be introduced as a transgene into the genome of 
a non-human animal. Alternatively, a nonhuman homologue of a human AS3 gene, such as 
a mouse or rat AS3 gene, can be used as a transgene. Alternatively, an AS3 gene 
homologue, such as another AS3 family member, can be isolated based on hybridization to 

1 5 the AS3 cDNA sequences of SEQ ID NO:l or 3, or the DNA insert of the plasmid deposited 

with ATCC as Accession Number (described further in subsection I above) and used 

as a transgene. Intronic sequences and polyadenylation signals can also be included in the 
transgene to increase the efficiency of expression of the transgene. A tissue-specific 
regulatory sequence(s) can be operably linked to an AS3 transgene to direct expression of an 

20 AS3 protein to particular cells. Methods for generating transgenic animals via embryo 
manipulation and microinjection, particularly animals such as mice, have become 
conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866 and 
4,870,009, both by Leder et al., U.S. Patent No. 4,873,191 by Wagner et al. and in Hogan, 
B., Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 

25 Harbor, N.Y., 1986). Similar methods are used for production of other transgenic animals. 
A transgenic founder animal can be identified based upon the presence of an AS3 transgene 
in its genome and/or expression of AS3 mRNA in tissues or cells of the animals. A 
transgenic founder animal can then be used to breed additional animals carrying the 
transgene. Moreover, transgenic animals carrying a transgene encoding an AS3 protein can 

30 further be bred to other transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains at 
least a portion of an AS3 gene into which a deletion, addition or substitution has been 



MBI-008 

-45- 

introduced to thereby alter, e.g., functionally disrupt, the AS3 gene. The AS3 gene can be a 
human gene (e.g., the cDNA of SEQ ID NO:3), but more preferably, is a non-human 
homologue of a human AS3 gene (e.g., a cDNA isolated by stringent hybridization with the 
nucleotide sequence of SEQ ID NO:l). For example, a mouse AS3 gene can be used to 

5 construct a homologous recombination nucleic acid molecule, e.g., a vector, suitable for 
altering an endogenous AS3 gene in the mouse genome. In a preferred embodiment, the 
homologous recombination nucleic acid molecule is designed such that, upon homologous 
recombination, the endogenous AS3 gene is functionally disrupted (i.e., no longer encodes a 
functional protein; also referred to as a "knock out" vector). Alternatively, the homologous 

10 recombination nucleic acid molecule can be designed such that, upon homologous 

recombination, the endogenous AS3 gene is mutated or otherwise altered but still encodes 
functional protein (e.g., the upstream regulatory region can be altered to thereby alter the 
expression of the endogenous AS3 protein). In the homologous recombination nucleic acid 
molecule, the altered portion of the AS 3 gene is flanked at its 5' and 3' ends by additional 

1 5 nucleic acid sequence of the AS3 gene to allow for homologous recombination to occur 
between the exogenous AS 3 gene carried by the homologous recombination nucleic acid 
molecule and an endogenous AS3 gene in a cell, e.g., an embryonic stem cell. The 
additional flanking AS3 nucleic acid sequence is of sufficient length for successful 
homologous recombination with the endogenous gene. Typically, several kilobases of 

20 flanking DNA (both at the 5 r and 3' ends) are included in the homologous recombination 

nucleic acid molecule (see, e.g., Thomas, K.R. and Capecchi, M. R. (1987) Cell 51:503 for a 
description of homologous recombination vectors). The homologous recombination nucleic 
acid molecule is introduced into a cell, e.g., an embryonic stem cell line (e.g., by 
electroporation) and cells in which the introduced AS3 gene has homologously recombined 

25 with the endogenous AS3 gene are selected (see e.g., Li, E. et al. (1992) Cell 69:915). The 
selected cells can then injected into a blastocyst of an animal (e.g., a mouse) to form 
aggregation chimeras (see e.g., Bradley, A. in Teratocarcinomas and Embryonic Stem Cells: 
A Practical Approach, E.J. Robertson, ed. (IRL, Oxford, 1987) pp. 1 13-152). A chimeric 
embryo can then be implanted into a suitable pseudopregnant female foster animal and the 

30 embryo brought to term. Progeny harboring the homologously recombined DNA in their 
germ cells can be used to breed animals in which all cells of the animal contain the 
homologously recombined DNA by germline transmission of the transgene. Methods for 
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constructing homologous recombination nucleic acid molecules, e.g., vectors, or 
homologous recombinant animals are described further in Bradley, A. (1991) Current 
Opinion in Biotechnology 2:823-829 and in PCT International Publication Nos.: WO 
90/1 1354 by Le Mouellec et al.; WO 91/01 140 by Smithies et al.; WO 92/0968 by Zijlstra et 
5 al.; and WO 93/04169 by Berns et al. 

In another embodiment, transgenic non-humans animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PL For a 
description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) Proc. Natl 

10 Acad. Sci USA 89:6232-6236. Another example of a recombinase system is the FLP 
recombinase system of Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 
251:1351-1355. If a cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Cre recombinase and a selected 
protein are required. Such animals can be provided through the construction of "double" 

1 5 transgenic animals, e.g., by mating two transgenic animals, one containing a transgene 
encoding a selected protein and the other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, I. et al. (1997) Nature 385:810-813 and PCT 
International Publication Nos. WO 97/07668 and WO 97/07669. In brief, a cell, e.g., a 

20 somatic cell, from the transgenic animal can be isolated and induced to exit the growth cycle 
and enter G 0 phase. The quiescent cell can then be fused, e.g., through the use of electrical 
pulses, to an enucleated oocyte from an animal of the same species from which the quiescent 
cell is isolated. The reconstructed oocyte is then cultured such that it develops to morula or 
blastocyte and then transferred to pseudopregnant female foster animal. The offspring borne 

25 of this female foster animal will be a clone of the animal from which the cell, e.g., the 
somatic cell, is isolated. 

VII. Pharmaceutical Compositions 

The AS3 nucleic acid molecules, fragments of AS3 proteins, and anti-AS3 
30 antibodies (also referred to herein as "active compounds") of the invention can be 
incorporated into pharmaceutical compositions suitable for administration. Such 
compositions typically comprise the nucleic acid molecule, protein, or antibody and a 
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pharmaceutically acceptable carrier. As used herein the language "pharmaceutical^ 
acceptable carrier" is intended to include any and all solvents, dispersion media, coatings, 
antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. The use of such media and agents for 

5 pharmaceutical^ active substances is well known in the art. Except insofar as any 

conventional media or agent is incompatible with the active compound, use thereof in the 
compositions is contemplated. Supplementary active compounds can also be incorporated 
into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible with 

1 0 its intended route of administration. Examples of routes of administration include 

parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal 
(topical), transmucosal, and rectal administration. Solutions or suspensions used for 
parenteral, intradermal, or subcutaneous application can include the following components: 
a sterile diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, 

1 5 glycerine, propylene glycol or other synthetic solvents; antibacterial agents such as benzyl 
alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating 
agents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or 
phosphates and agents for the adjustment of tonicity such as sodium chloride or dextrose. 
pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The 

20 parenteral preparation can be enclosed in ampoules, disposable syringes or multiple dose 
vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. For intravenous administration, 

25 suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ (BASF, 
Parsippany, NJ) or phosphate buffered saline (PBS). In all cases, the composition must be 
sterile and should be fluid to the extent that easy syringability exists. It must be stable under 
the conditions of manufacture and storage and must be preserved against the contaminating 
action of microorganisms such as bacteria and fungi. The carrier can be a solvent or 

30 dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyetheylene glycol, and the like), and suitable mixtures 
thereof. The proper fluidity can be maintained, for example, by the use of a coating such as 
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lecithin, by the maintenance of the required particle size in the case of dispersion and by the 
use of surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic 
acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 
for example, sugars, polyalcohols such as manitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable compositions can be brought about by 
including in the composition an agent which delays absorption, for example, aluminum 
monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound 
(e.g., a fragment of an AS3 protein or an anti-AS3 antibody) in the required amount in an 
appropriate solvent with one or a combination of ingredients enumerated above, as required, 
followed by filtered sterilization. Generally, dispersions are prepared by incorporating the 
active compound into a sterile vehicle which contains a basic dispersion medium and the 
required other ingredients from those enumerated above. In the case of sterile powders for 
the preparation of sterile injectable solutions, the preferred methods of preparation are 
vacuum drying and freeze-drying which yields a powder of the active ingredient plus any 
additional desired ingredient from a previously sterile-filtered solution thereof 

Oral compositions generally include an inert diluent or an edible carrier. They can 
be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules, oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutically compatible 
binding agents, and/or adjuvant materials can be included as part of the composition. The 
tablets, pills, capsules, troches and the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth 
or gelatin; an excipient such as starch or lactose, a disintegrating agent such as alginic acid, 
Primogel, or corn starch; a lubricant such as magnesium stearate or Sterotes; a glidant such 
as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring 
agent such as peppermint, methyl salicylate, or orange flavoring. 
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For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
5 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, and 
include, for example, for transmucosal administration, detergents, bile salts, and fusidic acid 
derivatives. Transmucosal administration can be accomplished through the use of nasal 
sprays or suppositories. For transdermal administration, the active compounds are 

10 formulated into ointments, salves, gels, or creams as generally known in the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will protect 

1 5 the compound against rapid elimination from the body, such as a controlled release 

formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 
polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation of 
such formulations will be apparent to those skilled in the art. The materials can also be 

20 obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal 
suspensions (including liposomes targeted to infected cells with monoclonal antibodies to 
viral antigens) can also be used as pharmaceutically acceptable carriers. These can be 
prepared according to methods known to those skilled in the art, for example, as described in 
U.S. Patent No. 4,522,811. 

25 It is especially advantageous to formulate oral or parenteral compositions in dosage 

unit form for ease of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be 
treated; each unit containing a predetermined quantity of active compound calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical 

30 carrier. The specification for the dosage unit forms of the invention are dictated by and 
directly dependent on the unique characteristics of the active compound and the particular 
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therapeutic effect to be achieved, and the limitations inherent in the art of compounding such 
an active compound for the treatment of individuals. 

Toxicity and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the 
5 LD50 (the dose lethal to 50% of the population) and the ED50 (the dose therapeutically 
effective in 50% of the population). The dose ratio between toxic and therapeutic effects is 
the therapeutic index and it can be expressed as the ratio LD50/ED50. Compounds which 
exhibit large therapeutic indices are preferred. While compounds that exhibit toxic side 
effects may be used, care should be taken to design a delivery system that targets such 

1 0 compounds to the site of affected tissue in order to minimize potential damage to uninfected 
cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED50 with little or 

15 no toxicity. The dosage may vary within this range depending upon the dosage form 

employed and the route of administration utilized. For any compound used in the method of 
the invention, the therapeutically effective dose can be estimated initially from cell culture 
assays. A dose may be formulated in animal models to achieve a circulating plasma 
concentration range that includes the IC50 (i.e., the concentration of the test compound 

20 which achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such 
information can be used to more accurately determine useful doses in humans. Levels in 
plasma may be measured, for example, by high performance liquid chromatography. 

VIII. Gene Therapy 

25 The nucleic acid molecules of the invention can be inserted into vectors and used as 

gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for example, 
intravenous injection, local administration (see U.S. Patent 5,328,470) or by stereotactic 
injection (see e.g., Chen et aL (1994) Proc. Natl. Acad. Set USA 91:3054-3057). The 
pharmaceutical preparation of the gene therapy vector can include the gene therapy vector in 

30 an acceptable diluent, or can comprise a slow release matrix in which the gene delivery 
vehicle is imbedded. Alternatively, where the complete gene delivery vector can be 
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produced intact from recombinant cells, e.g., retroviral vectors, the pharmaceutical 
preparation can include one or more cells which produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or dispenser 
together with instructions for administration. 

5 

IX. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein homologues, and antibodies described 
herein can be used in one or more of the following methods: a) screening assays; b) 
predictive medicine (e.g., diagnostic assays, prognostic assays, monitoring clinical trials, 

10 and pharmacogenetics); and c) methods of treatment (e.g., therapeutic and prophylactic). As 
described herein, an AS3 protein of the invention modulates the arrest of cell proliferation, 
preferably hormone-mediated cell proliferation. 

Thus, the isolated nucleic acid molecules of the invention can be used, for example, 
to express AS3 protein (e.g., via a recombinant expression vector in a host cell in gene 

1 5 therapy applications), to detect AS3 mRNA (e.g., in a biological sample) or a genetic 

alteration in an AS3 gene, and to modulate AS3 activity, as described further below. The 
AS3 nucleic acid molecules can be used to treat disorders characterized by insufficient 
production of AS3. The AS3 proteins can be used to screen for naturally occurring AS3 
substrates, to screen for drugs or compounds which modulate AS 3 activity, as well as to 

20 treat disorders characterized by insufficient production of AS3 protein or production of AS3 
protein forms which have decreased, aberrant or unwanted activity compared to AS3 wild 
type protein (e.g., excessive cell proliferation). Moreover, the anti-AS3 antibodies of the 
invention can be used to detect and isolate AS3 proteins, regulate the bioavailability of AS3 
proteins, and modulate AS3 activity. 

25 

IX, A, Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") for 
identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) which bind to AS3 proteins, have a 
30 stimulatory or inhibitory effect on, for example, AS3 expression or AS3 activity, or have a 
stimulatory or inhibitory effect on, for example, the expression or activity of AS3 substrate. 
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In one embodiment, the invention provides assays for screening candidate or test 
compounds which are substrates of an AS3 protein or polypeptide or biologically active 
portion thereof. In another embodiment, the invention provides assays for screening 
candidate or test compounds which bind to or modulate the activity of an AS3 protein or 
5 polypeptide or biologically active portion thereof. The test compounds of the present 
invention can be obtained using any of the numerous approaches in combinatorial library 
methods known in the art, including: biological libraries; spatially addressable parallel solid 
phase or solution phase libraries; synthetic library methods requiring deconvolution; the 
'one-bead one-compound' library method; and synthetic library methods using affinity 

10 chromatography selection. The biological library approach is limited to peptide libraries, 
while the other four approaches are applicable to peptide, non-peptide oligomer or small 
molecule libraries of compounds (Lam, K.S. (1997) Anticancer Drug Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the art, 
for example in: DeWitt et al (1993) Proc. Natl Acad Set U.S.A. 90:6909; Erb et al. 

15 (1994) Proc. Natl Acad. Scl USA 91:1 1422; Zuckermann et al. (1994). J. Med. Chem. 
37:2678; Cho et al. (1993) Science 261:1303; Carrell et al. (1994) Angew. Chem. Int. Ed 
Engl 33:2059; Carell et al (1994) Angew. Chem. Int. Ed. Engl 33:2061; and in Gallop et 
al. (1994) J. Med. Chem. ?>1:\2?>3. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) 

20 Biotechniques 1 3:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 

(1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP '409), 
plasmids (Cull et al. (1 992) Proc Natl Acad Sci USA 89:1 865-1 869) or on phage (Scott and 
Smith (1990) Science 249:386-390); (Devlin (1990) Science 249:404-406); (Cwirla et al. 
(1990) Proc. Natl Acad Sci. 87:6378-6382); (Felici (1991) J. Mol Biol 222:301-310); 

25 (Ladner supra. ) . 

In one embodiment, an assay is a cell-based assay in which a cell which expresses an 
AS3 protein or biologically active portion thereof is contacted with a test compound and the 
ability of the test compound to modulate AS3 activity is determined. Determining the 
ability of the test compound to modulate AS3 activity can be determined by monitoring, for 

30 example, changes in cell using standard techniques. 

In another embodiment, an assay of the present invention is a cell-free assay in 
which an AS3 protein or biologically active portion thereof is contacted with a test 
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compound and the ability of the test compound to bind to the AS3 protein or biologically 
active portion thereof is determined. Preferred biologically active portions of the AS 3 
proteins to be used in assays of the present invention include fragments which participate in 
interactions with non-AS3 molecules, e.g., fragments with high surface probability scores 
5 (see, for example, Figure 2). Binding of the test compound to the AS3 protein can be 

determined either directly or indirectly as described above. In a preferred embodiment, the 
assay includes contacting the AS3 protein or biologically active portion thereof with a 
known compound which binds AS 3 to form an assay mixture, contacting the assay mixture 
with a test compound, and determining the ability of the test compound to interact with an 

10 AS3 protein, wherein determining the ability of the test compound to interact with an AS3 
protein comprises determining the ability of the test compound to preferentially bind to AS3 
or biologically active portion thereof as compared to the known compound. 

In yet another embodiment, the cell-free assay involves contacting an AS3 protein or 
biologically active portion thereof with a known compound which binds the AS3 protein to 

15 form an assay mixture, contacting the assay mixture with a test compound, and determining 
the ability of the test compound to interact with the AS3 protein, wherein determining the 
ability of the test compound to interact with the AS3 protein comprises determining the 
ability of the AS3 protein to preferentially bind to or modulate the activity of an AS3 target 
molecule. 

20 In more than one embodiment of the above assay methods of the present invention, it 

may be desirable to immobilize AS3 to facilitate separation of proteins that interact with 
AS3, as well as to accommodate automation of the assay. Binding of a test compound to an 
AS3 protein, or interaction of an AS3 protein with a target molecule in the presence and 
absence of a candidate compound, can be accomplished in any vessel suitable for containing 

25 the reactants. Examples of such vessels include microtitre plates, test tubes, and micro- 
centrifuge tubes. In one embodiment, a fusion protein can be provided which adds a domain 
that allows one or both of the proteins to be bound to a matrix. For example, glutathione-S- 
transferase/ AS3 fusion proteins or glutathione-S-transferase/target fusion proteins can be 
adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 

30 derivatized microtitre plates, which are then combined with the test compound or the test 
compound and either the non-adsorbed target protein or AS3 protein, and the mixture 
incubated under conditions conducive to complex formation (e.g., at physiological 
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conditions for salt and pH). Following incubation, the beads or microtitre plate wells are 
washed to remove any unbound components, the matrix immobilized in the case of beads, 
complex determined either directly or indirectly, for example, as described above. 
Alternatively, the complexes can be dissociated from the matrix, and the level of AS3 
5 binding or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either an AS3 protein or an AS3 target 
molecule can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 
AS 3 protein or target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) 

10 using techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), 
and immobilized in the wells of streptavi din-coated 96 well plates (Pierce Chemical). 
Alternatively, antibodies reactive with AS3 protein or target molecules but which do not 
interfere with binding of the AS3 protein to its target molecule can be derivatized to the 
wells of the plate, and unbound target or AS3 protein trapped in the wells by antibody 

1 5 conjugation. Methods for detecting such complexes, in addition to those described above 
for the GST-immobilized complexes, include immunodetection of complexes using 
antibodies reactive with the AS3 protein or target molecule, as well as enzyme-linked assays 
which rely on detecting an enzymatic activity associated with the AS 3 protein or target 
molecule. 

20 In another embodiment, modulators of AS3 expression are identified in a method 

wherein a cell is contacted with a candidate compound and the expression of AS3 mRNA or 
protein in the cell is determined. The level of expression of AS3 mRNA or protein in the 
presence of the candidate compound is compared to the level of expression of AS3 mRNA 
or protein in the absence of the candidate compound. The candidate compound can then be 

25 identified as a modulator of AS3 expression based on this comparison. For example, when 
expression of AS3 mRNA or protein is greater (statistically significantly greater) in the 
presence of the candidate compound than in its absence, the candidate compound is 
identified as a stimulator of AS3 mRNA or protein expression. Alternatively, when 
expression of AS3 mRNA or protein is less (statistically significantly less) in the presence of 

30 the candidate compound than in its absence, the candidate compound is identified as an 
inhibitor of AS3 mRNA or protein expression. This assay may be further modified to 
include the presence of a hormone, e.g., an androgen or an anti-androgen. The level of AS3 
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mRNA or protein expression in the cells can be determined by methods described herein for 
detecting AS3 mRNA or protein. 

In yet another aspect of the invention, the AS3 proteins can be used as "bait proteins" 
in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Patent No. 5,283,317; Zervos et 
5 al. (1993) Cell 72:223-232; Madura et al. (1993) 1 Biol Chem. 268:12046-12054; Bartel et 
al. (1993) Biotechniques 14:920-924; Iwabuchi et al. (1993) Oncogene 8:1693-1696; and 
Brent WO94/10300), to identify other proteins, which bind to or interact with AS3 ("AS3- 
binding proteins" or "AS3-bp") and are involved in AS3 activity. Such AS3-binding 
proteins are also likely to be involved in the propagation of signals by the AS3 proteins or 

1 0 AS3 targets as, for example, downstream elements of an AS3-mediated signaling pathway. 
Alternatively, such AS3-binding proteins are likely to be AS3 inhibitors. 

The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes 
two different DNA constructs. In one construct, the gene that codes for an AS3 protein is 

1 5 fused to a gene encoding the DNA binding domain of a known transcription factor (e.g., 
GAL-4). In the other construct, a DNA sequence, from a library of DNA sequences, that 
encodes an unidentified protein ("prey" or "sample") is fused to a gene that codes for the 
activation domain of the known transcription factor. If the "bait" and the "prey" proteins are 
able to interact, in vivo, forming an AS 3 -dependent complex, the DNA-binding and 

20 activation domains of the transcription factor are brought into close proximity. This 

proximity allows transcription of a reporter gene (e.g., LacZ) which is operably linked to a 
transcriptional regulatory site responsive to the transcription factor. Expression of the 
reporter gene can be detected and cell colonies containing the functional transcription factor 
can be isolated and used to obtain the cloned gene which encodes the protein which interacts 

25 with the AS3 protein. 

In another aspect, the invention pertains to a combination of two or more of the 
assays described herein. For example, a modulating agent can be identified using a cell- 
based or a cell free assay, and the ability of the agent to modulate the activity of an AS3 
protein can be confirmed in vivo, e.g., in an animal such as an animal model for pain. 

30 This invention further pertains to novel agents identified by the above-described 

screening assays. Accordingly, it is within the scope of this invention to further use an agent 
identified as described herein in an appropriate animal model. For example, an agent 
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identified as described herein (e.g., an AS3 modulating agent, an antisense AS3 nucleic acid 
molecule, an AS3-specific antibody, or an AS3-binding partner) can be used in an animal 
model to determine the efficacy, toxicity, or side effects of treatment with such an agent. 
Alternatively, an agent identified as described herein can be used in an animal model to 
5 determine the mechanism of action of such an agent. Furthermore, this invention pertains to 
uses of novel agents identified by the above-described screening assays for treatments as 
described herein. 

IX, B, Detection Assays 

10 Portions or fragments of the cDNA sequences identified herein (and the 

corresponding complete gene sequences) can be used in numerous ways as polynucleotide 
reagents. For example, these sequences can be used to: (i) map their respective genes on a 
chromosome; and, thus, locate gene regions associated with genetic disease; (ii) identify an 
individual from a minute biological sample (tissue typing); and (iii) aid in forensic 

1 5 identification of a biological sample. These applications are described in the subsections 
below. 

IX, B, 1 Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 

20 sequence can be used to map the location of the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the AS3 nucleotide 
sequences, described herein, can be used to map the location of the AS 3 genes on a 
chromosome as described in Example 4. The mapping of the AS3 sequences to 
chromosomes is an important step in correlating these sequences with genes associated with 

25 disease. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with the AS3 gene, can be determined. If a mutation is 
observed in some or all of the affected individuals but not in any unaffected individuals, 
then the mutation is likely to be the causative agent of the particular disease. Comparison of 
30 affected and unaffected individuals generally involves first looking for structural alterations 
in the chromosomes, such as deletions or translocations that are visible from chromosome 
spreads or detectable using PCR based on that DNA sequence. Ultimately, complete 
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sequencing of genes from several individuals can be performed to confirm the presence of a 
mutation and to distinguish mutations from polymorphisms. 

IX, B, 2., Tissue Typing 
5 The AS3 sequences of the present invention can also be used to identify individuals 

from minute biological samples. The United States military, for example, is considering the 
use of restriction fragment length polymorphism (RFLP) for identification of its personnel. 
In this technique, an individual's genomic DNA is digested with one or more restriction 
enzymes, and probed on a Southern blot to yield unique bands for identification. This 
10 method does not suffer from the current limitations of "Dog Tags" which can be lost, 

switched, or stolen, making positive identification difficult. The sequences of the present 
invention are useful as additional DNA markers for RFLP (described in U.S. Patent 
5,272,057). 

Furthermore, the sequences of the present invention can be used to provide an 

15 alternative technique which determines the actual base-by-base DNA sequence of selected 
portions of an individual's genome. Thus, the AS3 nucleotide sequences described herein 
can be used to prepare two PCR primers from the 5' and 3' ends of the sequences. These 
primers can then be used to amplify an individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, prepared in this manner, 

20 can provide unique individual identifications, as each individual will have a unique set of 
such DNA sequences due to allelic differences. The sequences of the present invention can 
be used to obtain such identification sequences from individuals and from tissue. The AS3 
nucleotide sequences of the invention uniquely represent portions of the human genome. 
Allelic variation occurs to some degree in the coding regions of these sequences, and to a 

25 greater degree in the noncoding regions. It is estimated that allelic variation between 

individual humans occurs with a frequency of about once per each 500 bases. Each of the 
sequences described herein can, to some degree, be used as a standard against which DNA 
from an individual can be compared for identification purposes. Because greater numbers of 
polymorphisms occur in the noncoding regions, fewer sequences are necessary to 

30 differentiate individuals. The noncoding sequences of SEQ ID NO.l can comfortably 

provide positive individual identification with a panel of perhaps 10 to 1,000 primers which 
each yield a noncoding amplified sequence of 100 bases. If predicted coding sequences, 
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such as those in SEQ ID NO:3 are used, a more appropriate number of primers for positive 
individual identification would be 500-2,000. 

If a panel of reagents from AS3 nucleotide sequences described herein is used to 
generate a unique identification database for an individual, those same reagents can later be 
5 used to identify tissue from that individual. Using the unique identification database, 
positive identification of the individual, living or dead, can be made from extremely small 
tissue samples. 

IX, B, 3., Use of Partial AS3 Sequences in Forensic Biology 

10 DNA-based identification techniques can also be used in forensic biology. Forensic 

biology is a scientific field employing genetic typing of biological evidence found at a crime 
scene as a means for positively identifying, for example, a perpetrator of a crime. To make 
such an identification, PCR technology can be used to amplify DNA sequences taken from 
very small biological samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, 

1 5 saliva, or semen found at a crime scene. The amplified sequence can then be compared to a 
standard, thereby allowing identification of the origin of the biological sample. 

The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g., PCR primers, targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic identifications by, for example, providing 

20 another "identification marker" (i.e. another DNA sequence that is unique to a particular 
individual). As mentioned above, actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme generated 
fragments. Sequences targeted to noncoding regions of SEQ ID NO:l are particularly 
appropriate for this use as greater numbers of polymorphisms occur in the noncoding 

25 regions, making it easier to differentiate individuals using this technique. Examples of 
polynucleotide reagents include the AS3 nucleotide sequences or portions thereof, e.g., 
fragments derived from the noncoding regions of SEQ ID NO:l having a length of at least 
20 bases, preferably at least 30 bases. 

The AS3 nucleotide sequences described herein can further be used to provide 

30 polynucleotide reagents, e.g., labeled or labelable probes which can be used in, for example, 
an in situ hybridization technique, to identify a specific tissue, e.g., brain tissue. This can be 
very useful in cases where a forensic pathologist is presented with a tissue of unknown 
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origin. Panels of such AS3 probes can be used to identify tissue by species and/or by organ 
type. 

In a similar fashion, these reagents, e.g., AS3 primers or probes can be used to screen 
tissue culture for contamination (i.e. screen for the presence of a mixture of different types 
5 of cells in a culture). 

IX, C, Predictive Medicine 

The present invention also pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, and monitoring clinical trials are used for prognostic 

10 (predictive) purposes to thereby treat an individual prophylactically. Accordingly, one 
aspect of the present invention relates to diagnostic assays for determining AS3 protein 
and/or nucleic acid expression as well as AS3 activity, in the context of a biological sample 
(e.g., blood, serum, cells, tissue) to thereby determine whether an individual is afflicted with 
a disease or disorder, or is at risk of developing a disorder, associated with aberrant or 

1 5 reduced AS3 expression or activity. The invention also provides for prognostic (or 

predictive) assays for determining whether an individual is at risk of developing a disorder 
associated with AS3 protein, nucleic acid expression or activity. For example, mutations in 
an AS3 gene can be assayed in a biological sample. Such assays can be used for prognostic 
or predictive purpose to thereby prophylactically treat an individual prior to the onset of a 

20 disorder characterized by or associated with AS3 protein, nucleic acid expression, or 
activity. 

Another aspect of the invention pertains to monitoring the influence of agents (e.g., 
drugs, compounds) on the expression or activity of AS3 in clinical trials. 

These and other agents are described in further detail in the following sections and in 
25 Examples 4 and 5. 

IX, C, 1., Diagnostic Assays 

An exemplary method for detecting the presence or absence of AS 3 protein or 
nucleic acid in a biological sample involves obtaining a biological sample from a test subject 
30 and contacting the biological sample with a compound or an agent capable of detecting AS3 
protein or nucleic acid (e.g., mRNA, genomic DNA) that encodes AS3 protein such that the 
presence of AS3 protein or nucleic acid is detected in the biological sample. A preferred 
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agent for detecting AS 3 mRNA or genomic DNA is a labeled nucleic acid probe capable of 
hybridizing to AS3 mRNA or genomic DNA. The nucleic acid probe can be, for example, a 
full-length AS3 nucleic acid, such as the nucleic acid of SEQ ID NO:l or 3, or the DNA 

insert of the plasmid deposited with ATCC as Accession Number , or a portion 

5 thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in 
length and sufficient to specifically hybridize under stringent conditions to AS3 mRNA or 
genomic DNA. Other suitable probes for use in the diagnostic assays of the invention are 
described herein. 

A preferred agent for detecting AS3 protein is an antibody capable of binding to AS3 

10 protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, or 
more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or 
F(ab ? )2) can be used. The term "labeled", with regard to the probe or antibody, is intended 
to encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) a 
detectable substance to the probe or antibody, as well as indirect labeling of the probe or 

1 5 antibody by reactivity with another reagent that is directly labeled. Examples of indirect 
labeling include detection of a primary antibody using a fluorescently labeled secondary 
antibody and end-labeling of a DNA probe with biotin such that it can be detected with 
fluorescently labeled streptavidin. The term "biological sample" is intended to include 
tissues, cells and biological fluids isolated from a subject, as well as tissues, cells and fluids 

20 present within a subject. That is, the detection method of the invention can be used to detect 
AS3 mRNA, protein, or genomic DNA in a biological sample in vitro as well as in vivo. For 
example, in vitro techniques for detection of AS 3 mRNA include Northern hybridizations 
and in situ hybridizations. In vitro techniques for detection of AS3 protein include enzyme 
linked immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 

25 immunofluorescence. In vitro techniques for detection of AS3 genomic DNA include 
Southern hybridizations. 

In one embodiment, the biological sample contains protein molecules from the test 
subject. Alternatively, the biological sample can contain mRNA molecules from the test 
subject or genomic DNA molecules from the test subject. A preferred biological sample is a 

30 tissue or cell sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control biological 
sample from a control subject, contacting the control sample with a compound or agent 
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capable of detecting AS3 protein, mRNA, or genomic DNA, such that the presence of AS3 
protein, mRNA or genomic DNA is detected in the biological sample, and comparing the 
presence of AS3 protein, mRNA or genomic DNA in the control sample with the presence 
of AS3 protein, mRNA or genomic DNA in the test sample. 
5 The invention also encompasses kits for detecting the presence of AS3 in a 

biological sample. For example, the kit can comprise a labeled compound or agent capable 
of detecting AS3 protein or mRNA in a biological sample; means for determining the 
amount of AS3 in the sample; and means for comparing the amount of AS3 in the sample 
with a standard. The compound or agent can be packaged in a suitable container. The kit 
10 can further comprise instructions for using the kit to detect AS3 protein or nucleic acid. 

IX, C, 2., Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant AS3 

15 expression or activity. As used herein, the term "aberrant" includes an AS3 expression or 
activity which deviates from the wild type AS3 expression or activity. Aberrant expression 
or activity includes increased or decreased expression or activity, as well as expression or 
activity which does not follow the wild type developmental pattern of expression or the 
subcellular pattern of expression. For example, aberrant AS 3 expression or activity is 

20 intended to include the cases in which a mutation in the AS3 gene causes the AS3 gene to be 
under-expressed or over-expressed and situations in which such mutations result in a non- 
functional AS3 protein or a protein which does not function in a wild-type fashion, e.g., a 
protein which does not interact with an AS3 ligand or one which interacts with a non-AS3 
ligand. 

25 The assays described herein, such as the preceding diagnostic assays or the following 

assays, can be utilized to identify a subject having or at risk of developing a malignancy 
associated with a misregulation in AS3 protein activity or nucleic acid expression, such as a 
prostate cancer. Thus, the present invention provides a method for identifying a disease or 
disorder associated with aberrant AS3 expression or activity in which a test sample is 

30 obtained from a subject and AS3 protein or nucleic acid (e.g., mRNA or genomic DNA) is 
detected, wherein the presence of AS3 protein or nucleic acid is diagnostic for a subject 
having or at risk of developing a disease or disorder associated with aberrant or unwanted 
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AS3 expression or activity. As used herein, a "test sample" refers to a biological sample 
obtained from a subject of interest. For example, a test sample can be a cell, tissue, or 
biological fluid containing a cell. 

Furthermore, the prognostic assays described herein can be used to determine 
5 whether a subject can be administered an agent (e.g., an agonist, antagonist, peptidomimetic, 
protein, peptide, nucleic acid, small molecule, or other drug candidate, such as a hormone, 
e.g., an androgen) to treat a disease or disorder associated with aberrant AS3 expression or 
activity. For example, such methods can be used to determine whether a subject can be 
effectively treated with an agent for aberrant cell proliferation, e.g., cancer of the prostate. 

10 The methods of the invention can also be used to detect genetic alterations in an AS3 

gene, thereby determining if a subject with the altered gene is at risk for a disorder 
characterized by misregulation in AS3 protein activity or nucleic acid expression, such as a 
proliferative disorder, e.g., cancer. In preferred embodiments, the methods include 
detecting, in a sample of cells from the subject, the presence or absence of a genetic 

1 5 alteration characterized by at least one of an alteration affecting the integrity of a gene 

encoding an AS3 -protein, or the mis-expression of the AS3 gene. For example, such genetic 
alterations can be detected by ascertaining the existence of at least one of 1) a deletion of 
one or more nucleotides from an AS3 gene; 2) an addition of one or more nucleotides to an 
AS3 gene; 3) a substitution of one or more nucleotides of an AS3 gene, 4) a chromosomal 

20 rearrangement of an AS3 gene; 5) an alteration in the level of a messenger RNA transcript of 
an AS3 gene, 6) aberrant modification of an AS3 gene, such as of the methylation pattern of 
the genomic DNA, 7) the presence of a non-wild type splicing pattern of a messenger RNA 
transcript of an AS3 gene, 8) a non-wild type level of an AS3-protein, 9) allelic loss of an 
AS3 gene, and 10) inappropriate post-translational modification of an AS3-protein. As 

25 described herein, there are a large number of assays known in the art which can be used for 
detecting alterations in an AS3 gene. A preferred biological sample is a cell, tissue, or 
biological fluid containing a cell, isolated by conventional means from a subject. 

In certain embodiments, detection of the alteration involves the use of a probe/primer 
in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 4,683,202), 

30 such as anchor PCR or RACE PCR, or, alternatively, in a ligation chain reaction (LCR) (see, 
e.g., Landegran et al. (1988) Science 241:1077-1080; and Nakazawa et al. (1994) Proc. Natl 
Acad ScL USA 91 :360-364), the latter of which can be particularly useful for detecting point 
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mutations in the AS3-gene (see Abravaya et al. (1995) Nucleic Acids Res .23:675-682). 
This method can include the steps of collecting a sample of cells from a subject, isolating 
nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, contacting the 
nucleic acid sample with one or more primers which specifically hybridize to an AS3 gene 
5 under conditions such that hybridization and amplification of the AS3-gene (if present) 
occurs, and detecting the presence or absence of an amplification product, or detecting the 
size of the amplification product and comparing the length to a control sample. It is 
anticipated that PCR and/or LCR may be desirable to use as a preliminary amplification step 
in conjunction with any of the techniques used for detecting mutations described herein. 

10 Alternative amplification methods include: self sustained sequence replication 

(Guatelli, J.C. et al, (1990) Proa Natl. Acad ScL USA 87:1874-1878), transcriptional 
amplification system (Kwoh, D.Y. et aL, (1989) Proa Natl. Acad ScL USA 86:1 173-1 177), 
Q-Beta Replicase (Lizardi, P.M. et al. (1988) Bio-Technology 6:1 197), or any other nucleic 
acid amplification method, followed by the detection of the amplified molecules using 

15 techniques well known to those of skill in the art. These detection schemes are especially 
useful for the detection of nucleic acid molecules if such molecules are present in very low 
numbers. 

In an alternative embodiment, mutations in an AS3 gene from a sample cell can be 
identified by alterations in restriction enzyme cleavage patterns. For example, sample and 

20 control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and fragment length sizes are determined by gel electrophoresis and 
compared. Differences in fragment length sizes between sample and control DNA indicates 
mutations in the sample DNA. Moreover, the use of sequence specific ribozymes (see, for 
example, U.S. Patent No. 5,498,531) can be used to score for the presence of specific 

25 mutations by development or loss of a ribozyme cleavage site. 

In other embodiments, genetic mutations in AS3 can be identified by hybridizing a 
sample and control nucleic acids, e.g., DNA or RNA, to high density arrays containing 
hundreds or thousands of oligonucleotides probes (Cronin, M.T. et al. (1996) Human 
Mutation 7: 244-255; Kozal, MJ. et al. (1996) Nature Medicine 2: 753-759). For example, 

30 genetic mutations in AS 3 can be identified in two dimensional arrays containing light- 
generated DNA probes as described in Cronin, M.T. et al, supra. Briefly, a first 
hybridization array of probes can be used to scan through long stretches of DNA in a sample 
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and control to identify base changes between the sequences by making linear arrays of 
sequential overlapping probes. This step allows the identification of point mutations. This 
step is followed by a second hybridization array that allows the characterization of specific 
mutations by using smaller, specialized probe arrays complementary to all variants or 
5 mutations detected. Each mutation array is composed of parallel probe sets, one 

complementary to the wild-type gene and the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in the art 
can be used to directly sequence the AS3 gene and detect mutations by comparing the 
sequence of the sample AS3 with the corresponding wild-type (control) sequence. 

1 0 Examples of sequencing reactions include those based on techniques developed by Maxam 
and Gilbert ((1977) Proc. Natl Acad Scl USA 74:560) or Sanger ((1977) Proa Natl Acad 
ScL USA 74:5463). It is also contemplated that any of a variety of automated sequencing 
procedures can be utilized when performing the diagnostic assays ((1995) Biotechniques 
19:448), including sequencing by mass spectrometry (see, e.g., PCT International 

15 Publication No. WO 94/16101; Cohen et al. (1996) Adv. Chromatogr. 36:127-162; and 
Griffin etal. (1993)^/. Biochem. Biotechnol 38:147-159). 

Other methods for detecting mutations in the AS 3 gene include methods in which 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes (Myers et al. (1985) Science 230:1242). In general, the art 

20 technique of "mismatch cleavage" starts by providing heteroduplexes of formed by 

hybridizing (labeled) RNA or DNA containing the wild-type AS 3 sequence with potentially 
mutant RNA or DNA obtained from a tissue sample. The double-stranded duplexes are 
treated with an agent which cleaves single-stranded regions of the duplex such as which will 
exist due to basepair mismatches between the control and sample strands. For instance, 

25 RNA/DNA duplexes can be treated with RNAse and DNA/DNA hybrids treated with S 1 
nuclease to enzymatically digesting the mismatched regions. In other embodiments, either 
DNA/DNA or RNA/DNA duplexes can be treated with hydroxylamine or osmium tetroxide 
and with piperidine in order to digest mismatched regions. After digestion of the 
mismatched regions, the resulting material is then separated by size on denaturing 

30 polyacrylamide gels to determine the site of mutation. See, for example, Cotton et al. 

(1988) Proc. Natl Acad Sci USA 85:4397; Saleeba et al. (1992) Methods Enzymol 217:286- 
295. In a preferred embodiment, the control DNA or RNA can be labeled for detection. 
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In still another embodiment, the mismatch cleavage reaction employs one or more 
proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations in 
AS3 cDNAs obtained from samples of cells. For example, the mutY enzyme of E. coli 
5 cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells cleaves 
T at G/T mismatches (Hsu et al (1994) Carcinogenesis 15:1657-1662). According to an 
exemplary embodiment, a probe based on an AS3 sequence, e.g., a wild-type AS3 
sequence, is hybridized to a cDNA or other DNA product from a test cell(s). The duplex is 
treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be 
10 detected from electrophoresis protocols or the like. See, for example, U.S. Patent No. 
5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to identify 
mutations in AS3 genes. For example, single strand conformation polymorphism (SSCP) 
may be used to detect differences in electrophoretic mobility between mutant and wild type 

15 nucleic acids (Orita et al (1989) Proc Natl Acad Sci USA: 86:2766, see also Cotton (1993) 
Mutat. Res, 285:125-144; and Hayashi (1992) Genet. Anal Tech. Appl 9:73-79). Single- 
stranded DNA fragments of sample and control AS3 nucleic acids will be denatured and 
allowed to renature. The secondary structure of single-stranded nucleic acids varies 
according to sequence, the resulting alteration in electrophoretic mobility enables the 

20 detection of even a single base change. The DNA fragments may be labeled or detected 
with labeled probes. The sensitivity of the assay may be enhanced by using RNA (rather 
than DNA), in which the secondary structure is more sensitive to a change in sequence. In a 
preferred embodiment, the subject method utilizes heteroduplex analysis to separate double 
stranded heteroduplex molecules on the basis of changes in electrophoretic mobility (Keen 

25 et al. (1991) Trends Genet 7:5). 

In yet another embodiment the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing gradient 
gel electrophoresis (DGGE) (Myers et al. (1985) Nature 3 13:495). When DGGE is used as 
the method of analysis, DNA will be modified to insure that it does not completely denature, 

30 for example by adding a GC clamp of approximately 40 bp of high-melting GC-rich DNA 
by PCR. In a further embodiment, a temperature gradient is used in place of a denaturing 
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gradient to identify differences in the mobility of control and sample DNA (Rosenbaum and 
Reissner (1987) Biophys Chem 265:12753). 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
5 primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
which permit hybridization only if a perfect match is found (Saiki et al. (1986) Nature 
324:163); Saiki etal (1989) Proa Natl Acad Sci USA 86:6230). Such allele specific 
oligonucleotides are hybridized to PCR amplified target DNA or a number of different 

10 mutations when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on selective 
PCR amplification may be used in conjunction with the instant invention. Oligonucleotides 
used as primers for specific amplification may carry the mutation of interest in the center of 

15 the molecule (so that amplification depends on differential hybridization) (Gibbs et al. (1989) 
Nucleic Acids Res. 17:2437-2448) or at the extreme 3 1 end of one primer where, under 
appropriate conditions, mismatch can prevent, or reduce polymerase extension (Prossner 
(1993) Tibtech 1 1 :238). In addition it may be desirable to introduce a novel restriction site in 
the region of the mutation to create cleavage-based detection (Gasparini et al. (1992) MoL 

20 Cell Probes 6:1). It is anticipated that in certain embodiments amplification may also be 
performed using Taq ligase for amplification (Barany (1991) Proc. Natl. Acad. Sci USA 
88:189). In such cases, ligation will occur only if there is a perfect match at the 3' end of the 
5' sequence making it possible to detect the presence of a known mutation at a specific site by 
looking for the presence or absence of amplification. 

25 The methods described herein may be performed, for example, by utilizing pre- 

packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
patients exhibiting symptoms or family history of a disease or illness involving an AS3 
gene. 

30 Furthermore, any cell type or tissue in which AS3 is expressed, e.g., prostate tissue, 

may be utilized in the prognostic assays described herein. 
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IX, C, 3., Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., hormone therapy) on the expression or 
activity of an AS3 protein (e.g., the modulation of cell proliferation) can be applied not only 
in basic drug screening, but also in clinical trials. For example, the effectiveness of an agent 
5 determined by a screening assay as described herein to increase AS3 gene expression, 
protein levels, or upregulate AS3 activity, can be monitored in clinical trials of subjects 
exhibiting decreased AS3 gene expression, protein levels, or downregulated AS3 activity. 
Alternatively, the effectiveness of an agent determined by a screening assay to decrease AS3 
gene expression, protein levels, or downregulate AS3 activity, can be monitored in clinical 

10 trials of subjects exhibiting increased AS3 gene expression, protein levels, or upregulated 
AS3 activity. In such clinical trials, the expression or activity of an AS3 gene, and 
preferably, other genes (e.g., prostate-specific antigen (PSA)) that have been implicated in, 
for example, an AS3-associated disorder can be used as a "read out" or markers of the 
phenotype of a particular cell. 

1 5 For example, and not by way of limitation, genes, including AS3, that are modulated 

in cells by treatment with an agent (e.g., compound, drug or hormone) which modulates AS3 
activity (e.g., identified in a screening assay as described herein) can be identified. Thus, to 
study the effect of agents on cell proliferation modulated by AS3, for example, in a clinical 
trial, cells can be isolated and RNA prepared and analyzed for the levels of expression of 

20 AS3 and other genes implicated in the AS3-associated disorder. The levels of gene 

expression (e.g., a gene expression pattern) can be quantified by northern blot analysis or 
RT-PCR, as described herein, or alternatively by measuring the amount of protein produced, 
by one of the methods as described herein, or by measuring the levels of activity of AS 3 or 
other genes. In this way, the gene expression pattern can serve as a marker, indicative of the 

25 physiological response of the cells to the agent. Accordingly, this response state may be 
determined before, and at various points during treatment of the individual with the agent or 
may be used to determine when treatment is appropriate. 

In a preferred embodiment, the present invention provides a method for monitoring 
the effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 

30 peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate, e.g., 
a hormone, identified by the screening assays described herein) including the steps of (i) 
obtaining a pre-administration sample from a subject prior to administration of the agent; (ii) 
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detecting the level of expression of an AS3 protein, mRN A, or genomic DNA in the 
preadministration sample; (iii) obtaining one or more post-administration samples from the 
subject; (iv) detecting the level of expression or activity of the AS3 protein, mRNA, or 
genomic DNA in the post-administration samples; (v) comparing the level of expression or 
5 activity of the AS3 protein, mRNA, or genomic DNA in the pre-administration sample with 
the AS3 protein, mRNA, or genomic DNA in the post administration sample or samples; 
and (vi) altering the administration of the agent to the subject accordingly. For example, 
increased administration of the agent may be desirable to increase the expression or activity 
of AS3 to higher levels than detected, i.e., to increase the effectiveness of the agent. 
10 Alternatively, decreased administration of the agent may be desirable to decrease expression 
or activity of AS3 to lower levels than detected, i.e. to decrease the effectiveness of the 
agent. According to such an embodiment, AS3 expression or activity may be used as an 
indicator of the effectiveness of an agent, even in the absence of an observable phenotypic 
response. 

15 

IX, P., Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated with 
aberrant AS3 expression or activity. With regards to both prophylactic and therapeutic 

20 methods of treatment, such treatments may be specifically tailored or modified, based on 
knowledge obtained from the field of pharmacogenomics. "Pharmacogenomics", as used 
herein, refers to the application of genomics technologies such as gene sequencing, 
statistical genetics, and gene expression analysis to drugs in clinical development and on the 
market. More specifically, the term refers to the study of how a patient's genes determine 

25 his or her response to a drug (e.g., a patient's "drug response phenotype", or "drug response 
genotype",) Thus, another aspect of the invention provides methods for tailoring an 
individual's prophylactic or therapeutic treatment with either the AS3 molecules of the 
present invention or AS3 modulators according to that individual's drug response genotype. 
Pharmacogenomics allows a clinician or physician to target prophylactic or therapeutic 

30 treatments to patients who will most benefit from the treatment and to avoid treatment of 
patients who will experience toxic drug-related side effects. 
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IX ? P., 1., Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a disease 
or condition associated with an aberrant AS3 expression or activity, by administering to the 
subject an AS3 molecule or an agent which modulates AS3 expression or at least AS3 
5 activity. Subjects at risk for a disease (e.g., prostate cancer) which is caused or contributed 
to by aberrant AS3 expression or activity can be identified by, for example, any or a 
combination of diagnostic or prognostic assays as described herein. Administration of a 
prophylactic agent can occur prior to the manifestation of symptoms characteristic of the 
AS3 aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
10 progression. Depending on the type of AS3 aberrancy, for example, an AS3 molecule, AS3 
agonist (e.g., hormone therapy), or AS3 antagonist agent can be used for treating the subject. 
The appropriate agent can be determined based on screening assays described herein. 

IX, P., 2., Therapeutic Methods 

15 Another aspect of the invention pertains to methods of modulating AS3 expression 

or activity for therapeutic purposes. Accordingly, in an exemplary embodiment, the 
modulatory method of the invention involves contacting a cell with an AS3 molecule or 
agent that modulates AS3 protein activity associated with cell (e.g., cell proliferation). An 
agent that modulates AS3 protein activity can be an agent as described herein, such as a 

20 nucleic acid or a protein, a naturally-occurring target molecule of an AS3 protein (e.g., an 
AS3 substrate), an AS3 antibody, an AS3 agonist or antagonist, a peptidomimetic of an AS3 
agonist or antagonist, or other small molecule (e.g., hormone, such as an androgen). In one 
embodiment, the agent stimulates one or more AS3 activities. Examples of such stimulatory 
agents include androgen therapy or a nucleic acid molecule encoding AS3 that has been 

25 introduced into the cell. These modulatory methods can be performed in vitro (e.g., by 

culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the agent to 
a subject). As such, the present invention provides methods of treating an individual 
afflicted with a cell proliferative disease (e.g., prostate cancer) or disorder characterized by 
aberrant or reduced expression or activity of an AS3 protein or nucleic acid molecule. In 

30 one embodiment, the method involves administering an agent (e.g., an agent identified by a 
screening assay or a hormone such as androgen as described herein), or combination of 
agents that modulates AS3 expression or activity. 
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Stimulation of AS3 activity is desirable in situations in which AS3 is abnormally 
downregulated and/or in which increased AS3 activity is likely to have a beneficial effect. 
For example, stimulation of AS 3 activity is desirable in a cell proliferative disease such as 
prostate cancer and increasing AS3 activity is likely to have a beneficial effect. Moreover, 
5 the ability to detect androgen-induced AS3 expression in a patient is an indication that the 
patient is responsive to hormone and therefore a candidate for intermittent hormone therapy. 
As used herein, the term "intermittent hormone therapy" or "intermittent hormone 
treatment" includes a treatment regime wherein a patient is treated with a hormone, such as 
an androgen, for a period of time and then withdrawn from such treatment for a period of 
10 time. This intermittent administration of hormone shall be, at least in part, determined by an 
analysis of the patients AS3 levels as described herein. 

IX, D, 3., Pharmacogenomics 

The AS3 molecules of the present invention, as well as agents, or modulators which 

15 have a stimulatory or inhibitory effect on AS3 activity (e.g., AS3 gene expression) as 

identified by a screening assay described herein can be administered to individuals to treat 
(prophylactically or therapeutically) AS3-associated disorders (e.g., cell proliferation) 
associated with aberrant or reduced AS3 activity. In conjunction with such treatment, 
pharmacogenomics (i.e., the study of the relationship between an individual's genotype and 

20 that individual's response to a foreign compound or drug) may be considered. Differences in 
metabolism of therapeutics can lead to severe toxicity or therapeutic failure by altering the 
relation between dose and blood concentration of the pharmacologically active drug. Thus, 
a physician or clinician may consider applying knowledge obtained in relevant 
pharmacogenomics studies in determining whether to administer an AS3 molecule or AS3 

25 modulator as well as tailoring the dosage and/or therapeutic regimen of treatment with an 
AS3 molecule or AS3 modulator. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected persons. 
See, for example, Eichelbaum, M. et al. (1996) Clin. Exp. Pharmacol. Physiol. 23(10-1 1) 

30 :983-985 and Linder, M.W. et al. (1997) Clin. Chem, 43(2):254-266. In general, two types 
of pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a 
single factor altering the way drugs act on the body (altered drug action) or genetic 
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conditions transmitted as single factors altering the way the body acts on drugs (altered drug 
metabolism). These pharmacogenetic conditions can occur either as rare genetic defects or 
as naturally-occurring polymorphisms. For example, glucose-6-phosphate dehydrogenase 
deficiency (G6PD) is a common inherited enzymopathy in which the main clinical 

5 complication is haemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, 
analgesics, nitrofurans) and consumption of fava beans. 

One pharmacogenomics approach to identifying genes that predict drug response, 
known as "a genome-wide association", relies primarily on a high-resolution map of the 
human genome consisting of already known gene-related markers (e.g., a "bi-allelic" gene 

1 0 marker map which consists of 60,000- 1 00,000 polymorphic or variable sites on the human 
genome, each of which has two variants.) Such a high-resolution genetic map can be 
compared to a map of the genome of each of a statistically significant number of patients 
taking part in a Phase II/III drug trial to identify markers associated with a particular 
observed drug response or side effect. Alternatively, such a high resolution map can be 

1 5 generated from a combination of some ten-million known single nucleotide polymorphisms 
(SNPs) in the human genome. As used herein, a "SNP" is a common alteration that occurs 
in a single nucleotide base in a stretch of DNA. For example, a SNP may occur once per 
every 1000 bases of DNA. A SNP may be involved in a disease process, however, the vast 
majority may not be disease-associated. Given a genetic map based on the occurrence of 

20 such SNPs, individuals can be grouped into genetic categories depending on a particular 
pattern of SNPs in their individual genome. In such a manner, treatment regimens can be 
tailored to groups of genetically similar individuals, taking into account traits that may be 
common among such genetically similar individuals. 

Alternatively, a method termed the "candidate gene approach", can be utilized to 

25 identify genes that predict drug response. According to this method, if a gene that encodes a 
drugs target is known (e.g., an AS3 protein of the present invention), all common variants of 
that gene can be fairly easily identified in the population and it can be determined if having 
one version of the gene versus another is associated with a particular drug response. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a major 

30 determinant of both the intensity and duration of drug action. The discovery of genetic 
polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 2) and 
cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation as to why 
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some patients do not obtain the expected drug effects or show exaggerated drug response 
and serious toxicity after taking the standard and safe dose of a drug. These polymorphisms 
are expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor 
metabolizer (PM). The prevalence of PM is different among different populations. For 
5 example, the gene coding for C YP2D6 is highly polymorphic and several mutations have 
been identified in PM, which all lead to the absence of functional CYP2D6. Poor 
metabolizers of CYP2D6 and CYP2C19 quite frequently experience exaggerated drug 
response and side effects when they receive standard doses. If a metabolite is the active 
therapeutic moiety, PM show no therapeutic response, as demonstrated for the analgesic 
1 0 effect of codeine mediated by its CYP2D6-formed metabolite morphine. The other extreme 
are the so called ultra-rapid metabolizers who do not respond to standard doses. Recently, 
the molecular basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 
amplification. 

Alternatively, a method termed the "gene expression profiling", can be utilized to 
1 5 identify genes that predict drug response. For example, the gene expression of an animal 
dosed with a drug (e.g., an AS3 molecule or AS3 modulator of the present invention) can 
give an indication whether gene pathways related to toxicity have been turned on. 

Information generated from more than one of the above pharmacogenomics 
approaches can be used to determine appropriate dosage and treatment regimens for 
20 prophylactic or therapeutic treatment an individual. This knowledge, when applied to 

dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus enhance 
therapeutic or prophylactic efficiency when treating a subject with an AS3 molecule or AS3 
modulator, such as a modulator identified by one of the exemplary screening assays 
described herein. 

25 This invention is further illustrated by the following examples which should not be 

construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application are incorporated herein by reference. 
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EXAMPLE 1. 
ISOLATION AND CLONING OF HUMAN AS3 
In this example, the isolation and cloning of the gene encoding human AS3 is 
described. 

To isolate a cDNA encoding an inhibitor of prostate cancer progression, the LNCaP- 
FGC cell line established from a metastatic lymph node from a patient with prostate 
adenocarcinoma (Horoszewicz et al. , (1 983) Cancer Res. 43 : 1 809-1 8 1 8) was utilized. This 
cell line and related cell lines derived therefrom were cultured as previously described (Soto 
etai, (1995) Oncology Res. 7: 545-558; Soto et al, (1991)7. Steroid Biochem. 23: 87-94). 

In addition, to isolate a gene expressed at the proliferative shutoff point (i.e., Step 2), 
a subtractive strategy was used whereby the proliferation of LNCaP-FGC cells was arrested 
using two different treatments, namely, CD serum (i.e., androgen free serum) and high 
androgen concentrations. This selective approach takes advantage of the fact that the cells 
were equally arrested at the Gl stage of the cell cycle by different mechanisms (Soto et al, 
(1995) Oncology Res. 7: 545-558). 

Moreover, since regulatory mRNAs are frequently expressed at low copy numbers, a 
protocol was adopted using repeated PCR cycles to selectively amplify these sequences. 
The final subtracted pool, therefore, was enriched to represent high ranking regulatory 
elements in the androgen-induced proliferative shutoff (Step-2). 

Briefly, androgen-specific, low-abundance regulatory mRNA sequences expressed 
during the proliferative shutoff, were selected using the Wang-Brown approach (Wang et 
al, (1991) Proc. Natl. Acad. Sci. USA 88: 1 1505-1 1509). Short fragments of cDNAs were 
amplified first: then three cycles of subtractions and amplifications between the control and 
proliferation arrested cDNAs resulted in sequence pools that were differentially expressed 
(Geek et al, (1997) J. Steroid Biochem. Mol. Biol 63: 21 1-218). LNCaP-FGC cells were 
treated with 30 nM R1881 to generate proliferative shutoff. R1881 (methyltrienolone) is a 
synthetic, non-metabolized androgen (Roussell-UCLAF, Romainville, France). Exposure to 
androgen for 24 hours was required to commit LNCaP-FGC cells to an irreversible 
proliferative shutoff (Geek et al, (1997) J. Steroid Biochem. Mol. Biol. 63: 211-218). It was 
concluded that at this point, the genes responsible for the shutoff were highly induced. 
LNCaP-FGC cells reversibly arrested by CDHuS were considered as the shutoff-negative 
control; they were harvested after three days of CDHuS treatment. Total RNA was prepared 
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by the acidic guanidinium-thiocyanate method and polyA + RNA was purified by using the 
FastTrack kit (Invitrogen, San Diego, CA) (Chomczinsky et al 9 (1987) Anal. Biochem. 162: 
156-159). 

Double-stranded cDNA pools from R1881-treated cells (R cDNA) and CDHuS- 
5 treated cells (CD cDNA) were synthesized using the Copy Kit (Invitrogen), with oligo-dT 
priming. After AM and Rsal digestions and adaptor ligations, the constructs were PCR- 
amplified (GeneAmp Kit, Perkin Elmer, Foster City, CA). The amplified CD cDNA were 
digested with Eco RI, photobiotinylated (driver cDNA) and hybridized at 20-fold molar 
excess to an aliquot of non-biotinylated R cDNA. The hybridized non-specific sequences 
10 were eliminated by subsequent Streptavidin chromatography. After 3 cycles of selection, 
the amplified expressed sequence tag (EST) pool of the androgen-induced shutoff AS (R 
cDNA pool minus CD cDNA pool) sequences was digested with EcoRI, cloned into the 
BlueScript SK vector (Stratagene, La Jolla, CA) and transformed into E. coli (OneShot 
strain, Invitrogen). 

1 5 Isolation of unique cDNAs from the differentially expressed sequence pool was 

performed as follows. Recombinants were collected randomly from the shutoff-positive AS 
pool of the Wang-Brown differential library and were plated. Using the labeled CD- and R- 
subtracted (CD cDNA pool minus R cDNA pool), PCR-amplified DNA master mixes as 
probes, double hybridizations revealed 1 1 and 14 clones that were present exclusively in the 

20 CD and R clone sets, respectively (Geek et al, (1997) J. Steroid Biochem. Mol Biol 63: 
211-218). Multiple cross-hybridizations identified ten unique inserts. 

To sequence the identified EST fragments, PCR sequencing reactions were 
performed using the dsDNA Sequencing System (Life Technologies, Gaithersburg, MD). 
The EST DNA sequences were tested for homology to known DNA sequences using the 

25 FASTA and BLAST (National Center for Biotechnology Information, Bethesda, MD) 

programs. Five inserts were found with no match in GenBank (Geek et al, (1997) J. Steroid 
Biochem. Mol Biol 63: 21 1-218). For further analysis, the mRNA with the highest 
induction in shutoff positive LNCaP-FGC cells (AS3, >5-6-fold of the 5.3 kb mRNA, and 
>3 -4-fold of the 8 kb isoform) was selected. 

30 To isolate the full length AS3 cDNA sequence, a 262 bp AS3 tag sequence was 

utilized to design nested primer pairs to amplify the full length cDNA sequence from a 
cDNA library. The cDNA libraries were generated by Human Genome Sciences (Rockville, 
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MD), using polyA + mRNA preparations from androgen-treated or CDHuS-treated 
proliferation-arrested LNCaP-FGC cells. The Lambda ZAPII (UniZAP) phage was used as 
vector carrying EcoRI and Xhol cloning sites. The PCR reaction was designed to amplify 
the cloned unknown cDNA segments between the known tag sequence and the flanking 
5 vector sequences. Since the orientation of the tag sequence was not known, both ends of the 
insert were amplified in both directions. The vector primers used were commercially 
available sequencing primers: Ml 3 Reverse and T3 primers at the EcoRI site, and Ml 3-20 
and T7 primers at the Xhol site. 

For the PCR reaction, the Expand High Fidelity kit was used and a 1 (il phage 

10 suspension as template (Boehringer-Mannheim). A 40 cycle amplification in a Perkin- 

Elmer 9600 thermocycler resulted in the production of a 1370 bp 5' fragment and a 3250 bp 
3' fragment. These PCR products were purified using Qiagen columns, and sequenced by 
automatic sequencing using a primer walking strategy. The sequencing data showed that the 
open reading frame in the 5'end fragment did not have an authentic AUG codon. 

15 To search for the missing 5* end of the transcript, the Prostate Specific Marathon 

Ready cDNA preparation from Clontech was used. Amplifications with the Clontech 
anchored primer and a set of AS3 specific primers resulted in a 419 bp fragment. The DNA 
was cloned and sequencing data showed that it carried the N-terminal 118 amino acids of the 
open reading frame. The nucleotide sequence reported herein has been submitted to 

20 GenBank under the accession number U95825 (see also, Geek et al, (1999) J. Steriod 
Biochem. Mol Biol 68:41-50). 

EXAMPLE 2. 

CHARACTERIZATION OF THE AS3 cDNA SEQUENCE 
25 In this example, the features of the AS3 mRNA and cDNA sequences are described. 

Computer analysis of the sequenced 5253 bp AS3 cDNA identified a long open 
reading frame (Figure 1). The initiator methionine is at position 66, the stop codon was 
found at position 4239, and the region codes for a polypeptide of 1391 residues. The 
initiator is the first AUG codon downstream from the 5' end of the sequence, and appears in 
30 a strong Kozak-context (Kozak, (1991) J. Biol Chem. 266:19867-19870; Kozak, (1991) J. 
Cell Biol 1 15:887-903). The Northern blot size of the transcript is between 5.3 and 5.5 kb 
(Geek et al, (1997) J. Steroid Biochem. Mol Biol 63: 21 1-218) and since the sequence 
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reported here has 5253 nucleotides plus the poly- A tail, the 5' end of our sequence is at or 
within a few nucleotides of the 5 r physical end, further suggesting that the initiator is 
authentic. The 5' non-coding region is high in GC nucleotides (63.3 %), but it has no 
recognizable secondary structure elements or other sequence features. The 3' noncoding 
5 region has several destabilizing AT-rich elements (underlined in Figure 1), typical of 

transcripts claimed to play a role in cell proliferation (Shaw et aL, (1986) Cell 46:659-667, 
Chen et al 9 (1995) Trends Biochem. Set 20:465-470). The polyadenylation signal of the 
transcript is 25 bp upstream of the consensus GT-rich cleavage site (indicated in a larger 
font in Figure 1). 

10 

EXAMPLE 3. 

CHARACTERIZATION OF THE AS3 POLYPEPTIDE SEQUENCE 
In this example, various structural and functional features of the AS3 polypeptide are 
described. 

1 5 Computer analysis of the AS3 open reading frame was performed using the Translate 

program of the Wisconsin Package Version 9.0, Genetics Computer Group (GCG), 
Madison, Wisconsin. (3 -strand and a-helix structures were calculated by the Chou-Fasman 
method using PepStructure and PepPlot programs. Motif and profile predictions were 
calculated using various programs of the Wisconsin Package, or by using remote servers 

20 offering sequence analyses of protein functional domains through the Internet. The 
following remote servers were used: PRO WEB (http://www.proweb.org); BLOCKS 
(http://www.blocks.fhcrc.org); PRODOM ( http://www.toulouse.inra.fr/prodom/); PRINTS 
(http://www.biochem.ucl.ac.uk/cgi-bin/attwood/) and the Protein Kinase Resource 
(http://www.sdsc.edu/Kinases/). 

25 Employing the above programs, the expected molecular weight of the AS3 

polypeptide is determined as 186 kD. In addition, it's noted that the N-terminal 400 amino 
acid domain is characterized by a unique arrangement of 3 1 aliphatic residues (21 of them 
are leucines). Every seventh position (with minor variations) is occupied by a leucine or 
similar hydrophobic residues and in the five subdomains shown in Figure 2, the pattern is 

30 uninterrupted. The arrangement is typical for coiled-coil structures where one side of the 

long a-helices is hydrophobic and usually participates in protein-protein interactions (Lupas, 
(1996) Trends Biochem. Set 22:375-382; Beavil et al. 9 (1992) Proc. NatL Acad. Set USA 
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89:753-757). The leucine-zipper motif of DNA binding proteins is a specific subclass of 
this general pattern and the subdomain between positions 196 and 217 in the AS3 sequence 
is a perfect leucine-zipper. 

The AS3 polypeptide sequence between positions 400 and 600 has elements of a 

5 conserved Mg-ATP binding domain of various nucleotide triphosphate binding proteins 
including protein kinases. In Figure 3, the AS 3 sequence is shown in the conserved 
subdomain arrangements established by Hanks (Hanks et al. 9 (1991) Methods Enzymol 
200:38-62). The conserved (5-strand, a-helix structures and highly conserved critical 
residues are also indicated, together with the corresponding sequences of various protein 

10 kinases (Taylor etal, (1992) Amu. Rev. Cell Biol 8:429-462). Although the complete 
AS3 sequence did not appear to be related to any particular protein kinase or ATP binding 
protein, partial homology within the subdomains was maintained, and probably indicates 
that the domain is functional. Indeed, one feature of the AS3 polypeptide is the presence of 
several relatively well conserved kinase-related domains such as an Mg-nucleotide 

1 5 triphosphate binding pocket, as well as elements of the catalytic domain of several protein 
kinases (see Figure 3). Functional analysis of this domain using GST-fusion constructs 
indicates that the fusion construct polypeptide can form a complex resulting in the 
phosphorylation of at least two substrate proteins found in LNCaP-FGC cell extracts. 

In addition to the above-mentioned motifs, the AS3 polypeptide also has a functional 

20 protein kinase domain located at amino acid residues 474-680 (encoded by nucleic acids 1420- 
2040; see SEQ ID NO: 3). To demonstrate that this region of the AS3 polypeptide has kinase 
activity, and is, e.g., capable of phosphorylating cellular substrates, the nucleic acid encoding 
this region was PCR amplified (using a corresponding upstream primer with a BamHI site and 
a corresponding downstream primer with a EcoRI site), digested, purified, and cloned into 

25 BamHI sites of the bacterial GST-fusion vector pGEX-T2 thereby fusing the AS3 domain to 
the C-terminus of GST (thus, the fusion protein is referred to as GST-AS3). The resultant 
construct, encoding a GST-AS3 fusion protein, was then transferred into bacteria (BL21 
protease minus host) which were then induced to express the GST-AS3 fusion protein. 
Bacterial extracts containing the GST/AS3 fusion protein were incubated with MCF7-AR1 cell 

30 extracts in the presence of 32 P-ATP and chromatographed through glutathione-Sepharose. 
Proteins which specifically bound to the GST-AS3 fusion protein were resolved by SDS- 
PAGE and autoradiography using standard conditions. Two polypeptides (40 and 120 kDa) 
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were specifically purified and phosphorylated by the AS3-GST fusion protein demonstrating 
the functional kinase activity of this protein kinase domain of AS3. There was no 32 P-labelled 
band observed at 90 kDa (the mass of the AS3-GTS fusion protein) suggesting that the AS3 
kinase domain does not auto-phosphorylate but phosphorylates other proteins through, e.g., a 

5 docking mechanism. 

Furthermore, a putative nuclear localization sequence (NLS) 
(KKFTQVLEDDE KIRK ) resembling that of the androgen receptor and DNA polymerase- 
ct was localized at position 547( Zhou et al 9 (1994) J. Biol Chem. 269:131 15-13123; 
Bouvier et al 9 (1995 Mol Biol Cell 6:1697-1705). Further, the C-terminal region of the 

1 0 putative AS3 polypeptide contains several sequence elements that show similarities to DNA 
binding proteins. Motifs and ProfileScan searches in the Wisconsin Package indicated 
helix-loop-helix and Homeo-box signature sequences in the area, and a remote search on the 
BLOCKS server also identified DNA binding block elements in the C-terminal sequences. 
Still further, it is noted a serine-rich domain at position 1 139, and a proline/glycine-rich 

1 5 domain at the 1 284 position were also found. The C-terminal domain (about 200 amino 

acids) is highly charged and arranged in unique repeats of seven alternating acidic and basic 
domains. 

A BLASTP search performed on the GenBank database resulted in a single high 
score similarity with the bimD gene of the eukaryotic organism Aspergillus nidulans, where 

20 50% of the amino acid sequence was functionally similar in portions of the coiled-coil 
domain and the putative DNA binding domain at the C-terminus. The bimD protein has a 
basic leucine-zipper and a C-terminal charged (acidic) domain, similar to AS3, and appears 
to function as a DNA binding protein (Denison et al 9 (1992) Genetics, 134:1085-1096). 
Both the AS3 and the bimD proteins also have nuclear localization consensus sequences. 

25 Finally, to confirm that the AS3 polypeptide is capable of binding to DNA and 

moreover, to elucidate a DNA motif that binds to the DNA recognition sequence of AS3, an 
AS3-GST fusion polypeptide containing the DNA recognition sequence of AS3 was produced 
using a similar approach as used above to assay AS3 kinase activity. 

Briefly, an expression vector was engineered to encode the DNA binding domain of 

30 AS3 fused to the N-terminus of GST (thus, the fusion protein is referred to as AS3-GST) 

which, when expressed in bacteria, allowed for the production of an AS3-GST fusion protein 
for DNA binding studies. The AS3-GST protein was then purified by affinity chromatography 
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as described above. Next, the resultant, purified AS 3 -GST protein was incubated with a 
mixture of oligonucleotides comprising constant 5' (25 bp) and 3' (25 bp) ends and randomly 
generated middle segments (18 bp) and chromatographed through a glutathione-Sepharose the 
column. To avoid non-specific binding, double stranded poly(dl-dC) (average length 1 800 bp) 

5 was added to the mixture. The AS3-GST fusion protein, together with bound 

oligonucleotides, was eluted with glutathione and the bound oligonucleotides were amplified 
by PCR, purified, and subjected to a second round of column chromatography using AS3-GST 
before being PCR amplified and cloned into a TA vector. 

Thirty clones were sequenced to assess whether a specific nucleotide sequence was 

1 0 recognized by the AS3 -GST fusion product. The sequences were compared and analyzed by 
the PILE-UP program of the GCG package. The co-alignment of the sequences revealed a 
pattern of the following putative consensus sequence: 

5' C, T, A, [T/A], [T/A], A, G, [C/G], C, C, C, [C/G], G, C, [C/G]), C, A, [A/T], 3' (SEQ ID 
NO: 5) 

1 5 Interestingly, sequence similarities were found between the putative AS3 DNA 

recognition sequence and the NF-kappaB and the Mbpl recognition motifs. In addition, 
similarities within the p27 kitl promoter at positions -270, -383, -427, and -586 were also found 
indicating that important gene regulatory elements exist in the cell in which the AS 3 protein 
may interact with (Zhang et al, (1997) Biochem. Biophys. Acta., 1353:307-3 17). Interestingly, 

20 the transcription factor NF-kappaB, which is involved in the control of apoptosis, and Mbpl 
which is involved in the Gl-S cell cycle transition in yeast, also interact with similar DNA 
motifs (Waddicke/ al, (1999) Biochem. Pharmacol, 57:9-17; Koch et al, (1993) Science, 
261:1551-1557). 

Accordingly, it was concluded that AS3 is capable of binding to DNA in a sequence 
25 specific manner and therefore, may function as a transcriptional regulator of genes involved in 
cell growth control. 
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EXAMPLE 4. 

THE AS3 GENOMIC LOCUS AND USE OF AS3 RELATED MOLECULES AS 

MARKERS FOR DISEASE 
In this example, aspects of the AS3 genomic locus and the use of AS3 related 
molecules as markers for disease are described. 

A computer homology search in GenBank was performed and identified the AS 3 
genomic region as residing on chromosome 13ql2-ql3. This area is represented by cosmid 
267pl9, and on PI artificial chromosomes PAC26H23 and PAC49J10. Consensus splicing 
donor and acceptor sites were identified and the entire exon-intron structure of the AS3 gene 
was resolved comparing the cDNA sequence and the genomic sequence using the BLAST 
program (see Figure 4) (Shapiro et al, (1987) Nucl. Acids Res. 15:7155-7174). The actual 
cosmid and cDNA positions are listed in Figure 4, and the arrangement of exons is depicted 
in Figure 5. The area covers nearly 200,000 bp and the average size of the exons is 100-150 
bp. 

Interestingly, the AS 3 genomic area is centromeric to the RBI locus, and telomeric 
to BRCA2. The AS3 gene is transcribed in the same direction as BRCA2, and the coding 
strand is downstream from the breast cancer gene (Couch et al, (1997) Genomics 
36:86-99). On the opposite strand upstream of AS3, three regions were assigned to cDNAs 
of unknown functions. An expressed sequence (CG008) has been assigned to this area, and 
represents a portion of the AS3 transcript (Couch et al, (1997) Genomics 36:86-99). The N 
terminal 354 amino acids of the open reading frame are missing in the CG008 sequence in 
GenBank. The CG008 open reading frame terminates at amino acid 738 of the AS3 
sequence. The sequencing data reported herein and the published genomic sequence are 
identical, confirming the correct sequence of AS3. The extra C at nucleotide position 1,109 
in the CG008 sequence suggests a possible sequencing error that results in a frame shift and 
a stop codon at position 1,152 of the CG008 sequence. 

With the above information in hand, it was observed that several 
epidemiological studies support a link between breast and prostate cancers implying shared 
genetic suppressor elements in both disease states (Thiessen, (1974) Cancer 34:1 102-1 107; 
Tulinius et al, (1992) Br. Med. J. 305:855-857). For example, studies of breast cancer 
families with high loss of heterogenicity (LOH) in the BRCA2 area showed that high 
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prostate cancer incidence also occurred.in 4 out of 5 families examined (Gudmundsson et 
aL, (1995) Cancer Res. 55:4830-4832). Further, in the majority of the male relatives with 
prostate cancer in these families (86%), allelic losses in the BRCA2 area were also detected 
with some of these mutations occurring in the region immediately downstream of the 
5 BRCA2 gene (Gudmundsson et aL, (1995) Cancer Res. 55:4830-4832; Van den Berg et aL, 
(1996) Br. J. Cancer 74:1615-1619; Cleton-Jansen et al, (1995) Br. J. Cancer 
72:1241-1244). 

Moreover, it was noted that putative suppressors in the immediate vicinity of 
BRCA2 are not only limited to sex hormone-related cancers. For example, recent studies on 

10 chronic lymphoid leukemia detected a 1Mb allelic loss in this region, with no mutations in 
the BRCA2 gene, pointing to a cryptic suppressor next to this gene (Garcia-Marco et al 9 
(1996) Blood 88:1568-1575; Caldasefa/., (1997) Proc. Am. Assoc. Cancer Res. 38:191). 
As indicated herein, the coding sequence of AS3 lies within this area. Thus, AS3 (or AS3- 
related molecules) may be associated with a number of diseases and conditions at several 

15 different levels involving, e.g., genomic alterations (including deletions and/or mutations at 
the chromosomal level), altered transcription or transcript production, and/or altered protein 
or protein expression levels. Thus, " AS3 related molecules" include, nucleic acid fragments 
or probes derived from the genomic locus (or an AS3 cDNA), AS3 or AS3 related proteins 
or protein fragments. 

20 In addition to foregoing, the ability to determine AS3 protein expression levels, altered 

AS3 proteins, and/or AS3 protein expression patterns in, for example, different tissue samples 
(such a biopsy sample from, e.g., the prostate) may be desired. To this end, several antibodies 
have been developed that specifically bind the AS3 polypeptide. Briefly, computer aided 
sequence analysis of the AS3 protein was performed in order to identify several antigenic areas 

25 of the AS3 protein that were suitable for use as an immunogen. Accordingly, oligopeptides 
corresponding to amino acid residues 71 1-727 and 1,369-1387 of SEQ ID NO: 4 were 
synthesized and used to raise antibodies in, respectively, rabbits and chickens (egg yolk 
immunoglobulin Y). The resultant antibodies were both tested against cellular extracts derived 
from either LNCaP-FGC or MCF7-AR1 cells induced with androgen to express AS3 protein 

30 and both antibody preparations recognized a protein band of the expected size for AS3 protein. 
Thus, these antibodies can be used for performing, for example, immunocytochemistry on 
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various cell samples (e.g., biopsies of the prostate) for various prognostic and diagnostic 
determinations. 

Accordingly, it was concluded that the AS3-related molecules and AS3-specific 
antibodies disclosed herein are useful prognostic/diagnostic probes for evaluating diseases 
(e.g., the aforementioned cancers) that are associated with, or map to, the AS3 locus or are 
associated with altered AS3 protein expression. 

EXAMPLE 5. 

USE OF AS3 MOLECULES IN THE TREATMENT OF PROSTATE CANCER 
In this example, the use of AS3 as a marker in guiding the appropriate administration 
of hormone therapy for prostate cancer is discussed. 

It was determined that expression analysis of the AS3 transcript demonstrated 
proliferation arrest-specific induction patterns, starting soon (4-6 h) after androgen exposure 
(Geek et al, (1997) J. Steroid Biochem. Mol Biol. 63: 21 1-218). AS3 levels peaked at 18- 
20 h, about 4 h before the commitment for proliferative shutoff was detected, suggesting that 
this gene is a candidate for a shutoff mediator. Furthermore, expression of the AS3 
transcript positively correlated with proliferation arrest as this gene was expressed only in 
shutoff-positive cell lines and variants. In addition, LNCaP-FGC cells proliferated 
maximally in CDHuS supplemented with 30 pM R1881 and under these conditions AS3 was 
not expressed. When AS3 was strongly induced in the presence of hormone (i.e., 0.3-30 
nM of Rl 88 1 ), the cells were inhibited from proliferating. An additional observation that 
indicates the AS3 gene codes for an inhibitor of the proliferation of prostate cells is the 
increase of AS3 mRNA levels in the rat prostate when proliferation was arrested by 
prolonged androgen administration. In addition, comparable effects in MCF7-AR1 cells 
have been observed. Finally, significant homology between the fungal gene bimD and AS3 
was found. Importantly, it was noted that overexpression of bimD in Aspergillus nidulans 
results in a cell cycle arrest in Gl/S phase and observe a similar cell cycle arrest and 
concordant peak expression of AS3 in mammalian cells induced to undergo androgen- 
induced proliferative shutoff (Denison et al, (1992) Genetics, 134:1085-1096; Geek et al, 
(1997) J. Steroid Biochem. Mol. Biol. 63: 211-218). 

Based on these observations, it was concluded that AS 3 expression is a useful 
marker of responsiveness of prostate cancer cells to the inhibitory effect of androgens. In 
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order to make the tumor regress, avoid reoccurrence, and maintain an acceptable quality of 
life, patients presenting this response may be treated with alternate cycles of antiandrogens 
and androgens. Androgens have a biphasic effect on the normal prostate: an initial phase of 
increased proliferation (Step 1) followed by a phase of inhibition (Step 2). 
5 Prior to the invention, prostate cancer has been treated by hormone ablation 

(castration, antiandrogens) to take advantage of Step I. Usually after a significant 
regression, the remaining tumor cells become resistant and the tumor and/or metastases 
relapse. The invention avoids this problem by allowing the clinician to subject patients to an 
intermittent therapy. This is based on a classification of patients who have an increased 

1 0 chance to be responsive to the intermittent hormone therapy. Patients not showing AS 3 
positive cancer cells would then be subjected to alternative therapies (chemotherapy, 
radiation, etc.). In contrast, the protocol for AS3 positive patients would be to first 
administer antiandrogens, to block the proliferative effect and return PSA levels to normal, 
and then to treat the patient with physiological but high doses of androgen to elicit Step 2. 

15 The invention allows the clinician to confidently assess the initial hormone dependence of 
the tumor (i.e., whether tumor cells express AS3 in response to hormone) and determine 
when to re-expose the patient to hormone. Importantly, the use of intermittent hormone 
therapy allows the clinician to determine exactly when cells are no longer responding to 
hormone such that hormone treatment can be withdrawn before the cells become refractory 

20 to hormone treatment and unbeatable. If the patient develops metastases capable of being 
biopsied, assaying AS 3 levels as described herein would allow a renewed attempt to reduce 
the proliferation of those cancer cell by increasing the androgen concentration of the 
treatment. This important aspect of the invention should allow the clinician to lower the rate 
at which clones of cells develop hormone resistance and become capable of multiplying and 

25 metastasizing without androgen stimulation. Moreover, this new treatment regime affords a 
better quality of life for the patient because, unlike with constant hormone treatment, sexual 
drive and potency are recovered. 
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EXAMPLE 6 

ANTISENSE ASSAY FOR DEMONSTRATING THAT AS3 IS AN ANDROGEN- 
INDUCED SUPPRESSOR OF CELL PROLIFERATION 
In this example, an inducible antisense assay is provided for demonstrating that AS3 
5 is an androgen-inducible suppressor of cell proliferation. 

To demonstrate androgen-inducible AS3 suppressor activity in a proliferating cell (i.e. ? 
proliferative shut off activity), clonal cell lines were derived in which the presence or absence 
of AS3 could be controlled using an inducible AS3 antisense gene. The prediction was that 
cells incubated in the presence of androgen would have reduced levels of proliferation as 

1 0 compared to the same cells in the absence of androgen or cells in the presence of androgen but 
in which levels of AS3 had been experimentally ablated. Thus, cells incubated in the presence 
of androgen, but also induced to express antisense AS3 gene transcript which removes cellular 
levels of the AS3, would be predicted to grow like wild type cells because, even though the 
cells were being exposed to androgen, there would be insufficient AS3 to mediate the 

15 androgen signal reducing cell growth. This is precisely what was observed (see, e.g., Table 1), 
thereby demonstrating that AS3 is an androgen-induced suppressor of cell proliferation. 

In order to develop the foregoing novel cell lines in which to demonstrate that AS3 
mediates the androgen-induced shutoff effect, an inducible transgene encoding an AS 3 
antisense transcript (or empty vector as a negative control) was genetically engineered into a 

20 retroviral vector backbone for efficient, stable, integration into cells. In particular, the AS3 
antisense gene was cloned into the Clontech pRevTRE retroviral vector under the control of 
a tetracycline sensitive promoter. The promoter has seven repeats of the bacterial tetO 
operator sequence upstream of the minimal CMV promoter which can be bound by the 
tetracycline transactivator (tTA). The tTA is a fusion protein between the bacterial 

25 tetracycline repressor and the VI 6 herpes virus transactivator. The tetracycline 

transactivator is sensitive for tetracycline such that, in the presence of tetracycline, the 
transactivator cannot bind the tetracycline promoter so the transgene is "off 9 and conversely, 
in the absence of tetracycline, the gene is "on" (i.e., the Tet-Off system; see, e.g., Clontech 
pRevTRE manual for further details). 

30 To stably introduce the foregoing inducible vectors into cells, the vectors were first 

transfected into a packaging cell line (PT67) under drug selection (hygromycin). After 2-3 
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weeks of selection, the surviving packaging cells were used as a source of supernatants 
containing infectious virions containing the inducible transgene. 

The supernatants containing the highest amounts of virions were then used to infect 
the STFX1 cell line, a derivative of the MCF7-AR1 that has a tetracycline transactivator 
5 gene stably integrated. During the cloning and selection procedures, cells were maintained 
in 1 (ig/mL of tetracycline to suppress the expression of the inducible transgenes. 

In order to characterize the cell lines expressing AS3 antisense and the "control" cell 
lines having vector without insert, cells were incubated in the presence (i.e., the transgene is 
off) and absence of tetracycline (i.e., the transgene is on) and, after 36 hours, assayed for 

1 0 transcript expression using RT-PCR under standard conditions, 

To determine the affect of androgen induced AS3 suppression of cellular growth, 
cells lines showing high levels of inducible AS3 antisense expression were chosen for 
further study. Cells were seeded onto coverslips in standard growth medium and allowed to 
grow for 5 days in the presence of 10 |ig/ml tetracycline (hereafter "tet"). Then, the medium 

15 was changed to i) 10 p,g/ml tet, or ii) no tet for 36 hours as indicated in Table 1. Then, 

vehicle or 10 nM of the androgen R1881 was added. Finally, 24 hours later, the cells were 
treated with 10 |ag/ml of bromodeoxyuridine (BrdU) for several hours to measure the 
percent of cells actively proliferating as a function of BrdU incorporation during DNA 
replication. The cells were then fixed, Hoechst stained, and the percent number of BrdU- 

20 labeled cells was determined using immunocytochemistry (using standard BrdU labeling 
regents and protocols from Boheringer). 

As clearly indicated in Table 1, administration of the androgen R1881 results in a 
significant decrease in the number of proliferating cells in S phase (compare to sample 1 
where cells are not treated with androgen) when cellular levels of AS3 are unaffected 

25 because the transgene vector is empty and uninduced (sample 2), empty and induced 

(sample 4), or encoding AS3 but uninduced (sample 6). In stark contrast, when AS3 levels 
are reduced by antisense expression in the absence of tetracycline, the androgen R1881 no 
longer induces a proliferative shutoff (compare sample 8 with sample 6). 
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Table 1. Analysis of the effect of tetracycline and androgen on the percent BrdU- 
labeled nuclei of STFX1 cells containing vector only or AS3 antisense constructs 



Sample 


Treatment 


% Proliferatina Cells ( Mean +/- SEA 


1 


vector, +tet, -A 


35.0 +/-1.7 


2 


vector, +tet, +A 


9.4 +/-1.7 


3 


vector, -tet, -A 


36.6 +1-2.7 


4 


vector, -tet, +A 


10.7 +/-0.9 


5 


Antisense, +tet, -A 


34.7 +/-1.0 


6 


Antisense, +tet, +A 


9.7 +/-0.6 


7 


Antisense, -tet, -A 


32.2 +/-1.5 


8 


Antisense, -tet, +A 


29.1 +/-1.6 



5 Vector denotes STFX1 cells expressing empty vector under tetracycline control Tetracycline is 

abbreviated tet; A denotes 10 nM R1881. Antisense denotes STFX1 cells expressing AS3-antisense 
under tetracycline control Tetracycline represses the expression of the vector and the AS3 antisense. 
Each data point represents the mean of 8-9 low power fields containing about 150-200 cells/field. 
Data were analyzed using the Mann- Whitney test; significance was measured at pO.OOl 

10 

This experiment further shows that the expression of the empty vector neither 
interferes with the androgen-mediated shutoff, nor results in toxic effects. 

Accordingly, it is concluded that androgen induced suppression of cell proliferation 
(or proliferative shutoff) is modulated by AS3 because expression of an AS3 antisense that 
1 5 reduces cellular levels of AS3 results in the blocking of androgen-induced AS3-mediated 
suppression of cell growth. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
20 routine experimentation, many equivalents to the specific embodiments of the invention 

described herein. Such equivalents are intended to be encompassed by the following claims. 



What is claimed: 
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Claims 

1 . An isolated nucleic acid molecule selected from the group consisting of: 

5 (a) a nucleic acid molecule comprising the nucleotide sequence set forth in 

SEQ ID NO: 1; and 

(b) a nucleic acid molecule comprising the nucleotide sequence set forth in 

SEQ ID NO:3. 

10 2. An isolated nucleic acid molecule which encodes a polypeptide comprising the 

amino acid sequence set forth in SEQ ID NO: 2, 

3. An isolated nucleic acid molecule comprising the nucleotide sequence contained 
in the plasmid deposited with ATCC® as Accession Number . 

15 

4. An isolated nucleic acid molecule which encodes a naturally occurring allelic 
variant of a polypeptide comprising the amino acid sequence set forth in SEQ ID NO: 2. 

5. An isolated nucleic acid molecule selected from the group consisting of: 

20 

( a) a nucleic acid molecule comprising a nucleotide sequence which is at 
least 50% homologous to the nucleotide sequence of SEQ ID NO:l or 3, or a complement 
thereof; 

(b) a nucleic acid molecule comprising a fragment of at least 250 nucleotides 
25 of a nucleic acid comprising the nucleotide sequence of SEQ ID NO: 1 or 3, or a complement 

thereof; 

(c) a nucleic acid molecule which encodes a polypeptide comprising an 
amino acid sequence at least about 45% homologous to the amino acid sequence of SEQ ID 
NO:2; and 

30 (d) a nucleic acid molecule which encodes a fragment of a polypeptide 

comprising the amino acid sequence of SEQ ID NO:2 ? wherein the fragment comprises at 
least 15 contiguous amino acid residues of the amino acid sequence of SEQ ID NO:2. 
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6. An isolated nucleic acid molecule which hybridizes to the nucleic acid molecule 
of any one of claims 1, 2, 3, 4, or 5 under stringent conditions. 

5 7. An isolated nucleic acid molecule comprising a nucleotide sequence which is 

complementary to the nucleotide sequence of the nucleic acid molecule of any one of claims 
1,2, 3,4, or 5. 

8. An isolated nucleic acid molecule comprising the nucleic acid molecule of any 
10 one of claims 1, 2, 3, 4, or 5, and a nucleotide sequence encoding a heterologous 

polypeptide. 

9. A vector comprising the nucleic acid molecule of any one of claims 1, 2, 3, 4, or 

5. 

15 

10. The vector of claim 9, which is an expression vector. 

1 1 . A host cell transfected with the expression vector of claim 1 0. 

20 1 2. A method of producing a polypeptide comprising culturing the host cell of claim 

1 1 in an appropriate culture medium to, thereby, produce the polypeptide. 

13. An isolated polypeptide selected from the group consisting of: 

25 (a) a fragment of a polypeptide comprising the amino acid sequence of SEQ 

ID NO:2, wherein the fragment comprises at least 15 contiguous amino acids of SEQ ID 
NO:2; 

(b) a naturally occurring allelic variant of a polypeptide comprising the amino 
acid sequence of SEQ ID NO:2, wherein the polypeptide is encoded by a nucleic acid 
30 molecule which hybridizes to a nucleic acid molecule consisting of SEQ ID NO:l or 3 under 
stringent conditions; 
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(c) a polypeptide which is encoded by a nucleic acid molecule comprising a 
nucleotide sequence which is at least 50 % homologous to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NO:l or 3; 

(d) a polypeptide comprising an amino acid sequence which is at least 45% 
5 homologous to the amino acid sequence of SEQ ID NO:2. 

14. The isolated polypeptide of claim 13 comprising the amino acid sequence of 
SEQ ID NO:2. 

10 15. The polypeptide of claim 13, further comprising heterologous amino acid 

sequences. 

16. An antibody which selectively binds to a polypeptide of claim 13. 

15 17. A method for detecting the presence of a polypeptide of claim 13 in a sample 

comprising: 

(a) contacting the sample with a compound which selectively binds to the 
polypeptide; and 

(b) determining whether the compound binds to the polypeptide in the sample 
20 to thereby detect the presence of a polypeptide of claim 13 in the sample. 

18. The method of claim 17, wherein the compound which binds to the polypeptide 
is an antibody. 

25 1 9. A kit comprising a compound which selectively binds to a polypeptide of claim 

13 and instructions for use. 

20. A method for detecting the presence of a nucleic acid molecule of any one of 
claims 1, 2, 3, 4, or 5 in a sample comprising: 
30 (a) contacting the sample with a nucleic acid probe or primer which 

selectively hybridizes to the nucleic acid molecule; and 
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(b) determining whether the nucleic acid probe or primer binds to a nucleic 
acid molecule in the sample to thereby detect the presence of a nucleic acid molecule of any 
one of claims 1, 2, 3, 4, or 5 in the sample. 

21 . The method of claim 20, wherein the sample comprises mRNA molecules and is 
contacted with a nucleic acid probe. 

22. A kit comprising a compound which selectively hybridizes to a nucleic acid 
molecule of any one of claims 1, 2, 3 ? 4, or 5 and instructions for use. 

23. A method for identifying a compound which binds to a polypeptide of claim 13 
comprising: 

(a) contacting the polypeptide, or a cell expressing the polypeptide with a test 
compound; and 

(b) determining whether the polypeptide binds to the test compound. 

24. A method for identifying a compound which modulates the activity of a 
polypeptide of claim 13 comprising: 

(a) contacting a polypeptide of claim 13 with a test compound; and 

(b) determining the effect of the test compound on the activity of the 
polypeptide to thereby identify a compound which modulates the activity of the polypeptide. 

25. A method for modulating the activity of a polypeptide of claim 1 3 comprising 
contacting the polypeptide or a cell expressing the polypeptide with a compound in a 
sufficient concentration to modulate the activity of the polypeptide. 

26. A transgenic animal generated from a cell genetically engineered to lack nucleic 
acid encoding a AS3 polypeptide, said transgenic animal lacking expression of said AS3 
polypeptide. 
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27. A transgenic animal generated from a cell that contains a substantially pure 
nucleic acid replacing DNA encoding a AS3 polypeptide, wherein said nucleic acid is 
expressed in said transgenic animal. 

28. A method of identifying a compound that modulates cell proliferation, said 
method comprising: 

(a) providing a cell comprising aAS3 gene; and 

(b) contacting said cell with a candidate compound; and 

(c) monitoring expression of said AS3 gene, wherein an alteration in the level of 
expression of said gene indicates a compound which modulates cell proliferation. 

29. A method of identifying a compound that modulates cell proliferation, said 
method comprising: 

(a) providing a cell comprising a reporter gene operably linked to a promoter from a 
AS3 gene; 

(b) contacting said cell with a candidate compound; and 

(c) measuring expression of said reporter gene, an alteration in said expression in 
response to said candidate compound identifying a compound that is able to modulate cell 
proliferation. 

30. The method of claim 28 or 29, wherein said alteration is an increase indicating 
said compound is an inhibitor of cell proliferation. 

31. A method of inhibiting the proliferation of a cell, said method comprising 
administering to said cell an amount of AS3 polypeptide or fragment thereof sufficient to 
inhibit cell proliferation. 

32. A method of inhibiting cell proliferation in a mammal, said method comprising 
providing a transgene encoding a AS3 polypeptide or fragment thereof to a cell of said 
mammal, said transgene being positioned for expression in said cell. 



10 
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33. A method of inhibiting cell proliferation in a cell, said method comprising 
administering a compound which increases AS 3 activity. 

34. The method of claim 31, 32, or 33, wherein said AS3 is from a mammal. 

35. The method of claim 31, 32, or 33, wherein said cell is in a mammal. 

36. The method of claim 31, 32, or 33, wherein said cell is in a mammal diagnosed 
as having a condition involving cell proliferation. 

37. The method of claim 36, wherein said condition is cancer. 

38. The method of claim 37, wherein said cancer is prostate cancer. 



15 3 9. A method of diagnosing a mammal for the presence of disease involving altered 

cell proliferation or an increased likelihood of developing a disease involving altered cell 
proliferation, said method comprising isolating a sample of nucleic acid from said mammal 
and determining whether said nucleic acid comprises a AS3 mutation, said mutation being 
an indication that said mammal has a cell proliferative disease or an increased likelihood of 

20 developing a disease involving cell proliferation. 

40. A method of diagnosing a mammal for the presence of a disease involving 
altered cell proliferation or an increased likelihood of developing a disease involving altered 
cell proliferation, said method comprising measuring AS3 gene expression in a sample from 
25 said mammal, an alteration in said expression relative to a sample from an unaffected 

mammal being an indication that said mammal has a cell proliferative disease or increased 
likelihood of developing an cell proliferative disease. 



41. The method of claim 40, wherein said gene expression is measured by assaying 
30 the amount of AS 3 polypeptide or AS3 biological activity in said sample. 
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42. The method of claim 41 wherein said AS3 polypeptide is measured by 
immunological methods or by assaying the amount of AS3 RNA in said sample. 

43. The method of claim 40 or 41, wherein said mammal is a human. 

5 

44. The method of claim 40, wherein said measuring is performed after or 
concurrent with the administration of a hormone to said mammal. 

45. The method of claim 44, wherein said hormone as an androgen. 

10 

46. A kit for diagnosing a mammal for the presence of a disease involving altered 
cell proliferation or an increased likelihood of developing a disease involving altered cell 
proliferation, said kit comprising a substantially pure antibody that specifically binds a AS3 
polypeptide. 

15 

47. A kit for diagnosing a mammal for the presence of a disease involving altered 
cell proliferation or an increased likelihood of developing a disease involving altered cell 
proliferation, said kit comprising a material for measuring AS3 RNA. 

20 48. A method of obtaining a AS3 polypeptide, said method comprising: 

(a) providing a cell with DNA encoding a AS3 polypeptide, said DNA being 
positioned for expression in said cell; 

(b) culturing said cell under conditions for expressing said DNA; and 

(c) isolating said AS3 polypeptide. 

25 

49. A method of isolating a AS3 gene or portion thereof having sequence identity to 
human AS3, said method comprising amplifying by polymerase chain reaction said AS3 
gene or portion thereof using oligonucleotide primers wherein said primers 

(a) are each greater than 1 5 nucleotides in length; 
30 (b) each have regions of complementarity to opposite DNA strands in a region of the 

nucleotide sequence of SEQ ID NO: 1; and 
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(c) optionally contain sequences capable of producing restriction endonuclease cut 
sites in the amplified product; and isolating said AS3 gene or portion thereof. 

50. A method of detecting if a subject is at increased risk of developing prostate 
5 cancer comprising directly or indirectly: 

(a) detecting levels of AS3 nucleic acid or polypeptide; and 

(b) observing if the subject has AS3 levels that are reduced as compared to a 
standard wherein said reduced AS3 levels indicate said subject is at increased risk of 
developing prostate cancer. 

10 

51. A kit for determining if a subject is at increased risk of developing prostate 
cancer comprising: 

(a) at least one reagent that specifically detects an AS3 molecule, wherein said 
reagent is selected from the group consisting of antibodies that selectively bind AS3, and 

1 5 oligonucleotide probes that selectively bind to DNA encoding AS3; and 

(b) instructions for determining that the subject is at increased risk of developing 
prostate cancer by 

(c) detecting the presence or absence of AS3 in said subject with at least one reagent; 

and 

20 (d) observing whether or not the subject is at increased risk of developing prostate 

cancer by observing if the presence of AS 3 is or is not detected with said at least one 
reagent, wherein reduced or absent levels of AS3 indicates said subject is at increased risk of 
developing prostate cancer. 

25 52. A method of prognosis for prostate cancer comprising: 

(a) obtaining a biological sample from a subject; 

(b) measuring AS3 nucleic acid or polypeptide levels in said sample; 

(c) correlating said AS3 level with a baseline level, wherein the baseline level is 
determined by measuring levels of AS3 in disease free subjects; 

30 (d) correlating levels of AS3 at baseline or below, which is a negative result, with a 

poorer prognosis than a positive result, wherein the level of AS3 is above the baseline level. 
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53. The method of claim 52, wherein said measuring is performed after or concurrent 
with the administration of a hormone to said mammal. 

54. The method of claim 52, wherein said hormone as an androgen. 

5 

55. The method of claim 52, wherein the level of ASS is measured with an antibody 
or antibody fragment thereof that selectively binds AS3. 

56. The method of claim 52, wherein the level of AS3 is measured with an nucleic 
1 0 acid probe or primer that selectively binds AS3 . 

57. A method of treating prostate cancer comprising, 

(a) identifying a subject having or about to have prostate cancer; 

(b) administering a hormone to said subject; and 

15 (c) determining if said subject exhibits an increase in AS3 levels. 

58. The method of claim 57, wherein the method further includes determining 
that said subject exhibiting an increase in AS3 levels is likely to be responsive to the 
inhibitory effect of androgens as compared to a subject not exhibiting an increase in AS3 

20 levels. 

59. The method of claim 57, wherein said subject showing an increase in AS3 levels 
is a candidate for hormone treatment. 

25 60. The method of claim 59, wherein said subject showing an increase in AS3 levels 

is indicated as being a candidate for intermittent hormone treatment. 

61. The method of claim 57 ? wherein the hormone is an androgen. 

30 62. The method of claim 57, wherein the subject is a human. 



63. The method of claim 57, wherein said AS3 levels are nucleic acid 
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levels or polypeptide levels. 

64. The method of claim 57 ? wherein the level of AS3 is measured with an antibody 
or antibody fragment thereof that selectively binds AS 3. 

5 

65. The method of claim 57, wherein the level of AS3 is measured with an nucleic 
acid probe or primer that selectively binds AS 3. 

66. A method of diagnosing a mammal for the presence of disease associated with 
10 AS3 or an AS3 related molecule or an increased likelihood of developing a disease 

associated with an AS3 or an AS3 related molecule, said method comprising isolating a 
sample of nucleic acid from said mammal and determining whether said nucleic acid 
comprises a mutation in an AS3 or AS3 related molecule, said mutation being an indication 
that said mammal has a disease associated with AS3 or an AS 3 related molecule or an 
15 increased likelihood of developing a disease associated with an AS3 or an AS3 related 
molecule. 

67. A method of diagnosing a mammal for the presence of a disease associated with 
AS3 or an AS3 related molecule or an increased likelihood of developing a disease 

20 associated with an AS3 or an AS3 related molecule, said method comprising measuring 
expression levels of AS3 or an AS3 related molecule in a sample from said mammal, an 
alteration in said expression relative to a sample from an unaffected mammal being an 
indication that said mammal has a disease or increased likelihood of a disease associated 
with AS3 or an AS3 related molecule. 



25 
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A NOVEL ANDROGEN-INDUCED SUPPRESSOR OF 
CELL PROLIFERATION AND USES THEREOF 

5 Abstract of the Disclosure 

The invention provides novel AS3 nucleic acid sequences, AS3 polypeptides, anti- 
AS3 antibodies, and methods for modulating cell proliferation and detecting compounds that 
modulate cell proliferation. Diagnostic and therapeutic methods utilizing compositions of 
1 0 the invention are also provided. 



1 CCCt*CACCCCCGGAGTGIkik:G^^ 

MctAlitt isSerty*ThrArffThrAsnA*pClyLy 12 

101 AArrACATATCCGCCTGG^7CAAGCAA^ 

si LcThrTyrProProGlyValtysGluX LeSerAspLysI IcSerLy^G lU2 t^lcWa I ArgArgLcul ysMeiV*! Val LysTh rPheflctAspH^t 45 

201 GACCAGGACTCTCAAGAACJI^^ 

^ A*pGlrAspSertluGluG;uLy^luLcu7>^*^^ 79 

301 rCCTAGCCTCCTGCCn-Gcf^TArTTTCACC^mATC^CnTCAAGCf 

BuVa IA LaCysCyiLeuA I aA^pI IcPhcArgI IcTyrAtaProGtuAtaProTyrThrScrf roAypCyxLeutysAxpUePheMfrxPhet UrhrAr 1 12 

"401 ACAGTTGAAG^G£TAI^GGAT ACAAAGAGOXACMrFC^ 
glUnLcuLysGlyte^tuA^ThrLys^^ 

501 (^mACAAGATACr^TGAAATmCACTCAGCrATACAW 

SluLeuGluA^crA^luU«*heThr<UnL^ 179 

601 ACCTTATGAGCTCfATTATf TGTGAAG^GAT^ 

^ci*etStrSerJlcIlc£ysGCuGtyA^ThrValSerGLnG^^ 212 

701 K^GCATATGATTTOXAAAGGCTT^^ 

*Glr^iaTyrAspLcuAlaLy^ULe»JLeuLysLArgThrAlaGlriALaI teCluProTyrl leThrThrPhfcPhcAsnt;litVatLet«cU.euGlytys 2C5 

C01 ACA7CTA7CAGCtWTT7GTCAGAGCATGTC777GAC7TM 

T^rSerlVcSerAspLcuScrGltj^isVjiiPhrAspLeuI ieLeuGluleuTyrAsnl IcA^crHisLcuLctAc^erVclLfruPrc^LriLeuGloP 279 

$31 ' T7AAATTAAAGAGCAATGAT AATGAGGAGCGCCTACJ^ TCTCAAAACAA 

h£LyslcnjLy^crAsnA$pA*nGliiGluAr^^ 312 

1001 GCCACTTTU^GTGC7AC77GCGCAGG7iTAATGATATCXA7G 

^roU^TrpGirtCys7yrLeyClyArsPh*Asr^ 3^5 

1101 ipiTTTAGn^AAAGACTTAAGAGAG7ATCTf AAAGTGACG7 CACATCACCCTCACGAA 

AapleuAULysAspteuThrGtuTyrLeuLysVaU^ 379 

•1201 AAAAGGAUTTCTTCTGGTCAATGATCACYTAOT 

ysLy^Aspl iHtnjLeuVaLA=nAspH$5Lfn^^ ^12 

1301 TGC^CAAATTTATAAGAAAT ATGCTTTACAC7CAGCAGCrrCGAAAAGATCCTGCAA 

UAiaGinlleTyrLyslysTyrAlaLeuGlnSerAlaAlaGl^ 4^5 

UOt rjVAAATAG7ATTGAT^7CGACTACUG7TCAAO^^ 

GlnA^erUcA£pA£pArgL^LsuValGLuArgU<^h«^ *79 

1501 AtTTTGTATGCCACACTGGA77TAAATGC7GTGAAAGCATTG>A^ 

yr1_^TyrAlaT^tcuAspLcu^^ 512 

1601 c^ttaaccj^ctxaaaaCagatg^xagtgtc^ 

17U1 ATGAAGAAATTCACAE^G^GTTAGAAGATGATGAGAAAATAAGAflACCACrrTAG ^ 
K$tlyst.ysPhe7hrCinValL^^luAspA^lvn/ym 579 

1501 GTGTO^TGA/ATAACTAAGAAGT^ 

1901 rATAtUTACCCAATCTATCAGTGCTC77ATTAMtAAGTGA^ 

.si l^pThrGluSerncScrAlaUcuI ULy^tnVMAsr^y^erlleAipG^ 6^5 

2001 AT CA^GCAGGICTT&XACicCTT A*AC CAAAA * 

$101 tggatcatcaaaaagtagcaCaagctgcactacaaattttcaaaaac^ 

2201 ' T£TTTTACATC^CAAAVCTAAAAAy\G 

2301 cagaiatttgagcctctgcataagagcctacatcc^ 

K^ltfhi^^roLt^HisLy^erLC^ 77^ 

ATCAATTTGCTGCTCC77flSAAATCllG«^TACGI ACTTTCAT rGTGAAA£ATCTTCTC^TGAATGATCGCC.T7 CCAGGGAAAAAGACAACTAAACTTTG 
jq3G^heAlaAUProTrpl.Y^cr1rpVa^ £12 

2S01 CGTTCCAGATGAAGAAGTATCTCCT GAGACAATCCTCJkAAATTCACGCTATTAAAATC^TGGTTC^ 

?&01 TCAGGAACTTCTACtTTAAGATTGCTAACAAC^^ 

?7Q1 GACTTGCTc£tgGGAGTGCTATTG7GAAGCTCG^ 

^L^t^^iyScrAlallcVaayiUeuAUGlnGtuProCrsTyrHiiGLune] UThrUiwGluGVnTyrGlnLcuCrsAlaLeuAtaUeAc 912 
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ZCQ1 CGATGAATGCTATCAAGTAAGACAAGTGTTTCCrXAG^ 
nAspGluCys7yrGtnVaUpgGtfWalPheAla6ln^ 

2901 TGTCaJUU^GATCXTCrrAAAGtJVlMAA^ 

Cy^laLy^prVoVJ^ly^iuAr^r^laHisAl^^ leA*oValArsAr s GluTyrtCuLysQtnH i*AUAlaV 97V 

3001 "P*A CTCAAAAAT TA TT (?TCt CT TCTArCAGAGT AT CTTCY t CCAT ATACAA.TT CA£CTT ITGGCA CAT cicCXACATTATGTCAAACTACAGGATAT T GA 

. alSerGluLysLcvL^*p:.euLetf'raGluTyTValVal^^ 1012 

3101 ACMCTTAAAGATCTTAAAGMTCTCTTTGCrrncnTCTCGA 
uClnLeuLy^pVMLy^luCysUfljrrpPh^ 

3201 AATATTAJUkCAMCAAAACUTCCCXAA _ 
Asnl lcLy^GlnThrLy^Asp*L*GlnGlyProAspA^^U^ IcrletScrL 1079 

3301 a^ctac^acatacactttccaatctcctW 

ys5efThrThrTyrSerl.€^LcScrPr^y$A*pPPoV$tLc^ 1H2 

3401 KCTOn-I^^TciAAATCAfTTTTCUaCCTCGAAMCXTA^ 

3501 CAGACrjLAATCATCACJIMTCIUAACJGT^ 

GlnThriy^crSepApgMctGluThrValSerA^laSerSerSerS^^ 1179 

TGGATCACAGTGAAAATGAAGATTAC^CaATGTCTTCACCT^ 

*L*=p«isS*rGluA!^GliUspT>TThr*ctSepS^ 1212 

#bl CAAGCaAGAGGCAGGAAAAMACGCCrcT^tA^ 

uLysPrc*rgGiyArgLy£Ly*TKrproV*iThrtUuG^ 

ikll AGTCAGCGAAGTCGGAAAAGAGGTXATaCGGCT^ 

ScrClnApgScrArgLY^fgGl>*ii«ThrAtaSerGlu5^ 1279 

^901 ATC^CAQAATaGTCCGCC/ImaAAGGGT^^ 

spGiuGlnAxnScrProProlysLysGlyly^rgGlyArg^^ 1312 

£Q01 AAMCtJUG^XVUUUAAATC"* GGA^ 

€ xLysGlySePLyslysLysSerGlyProPrcAlaPr^luClUiluGit^ 1345 

X1Q1 CACCGAGTGT CAAGGAGAG^CAGCAGAGAGCAGAATCTCCT 

Hi^rsVMStrArsArgAlaGtnqirV^r^LaGluSerProGluS^ ^379 

2£z01 O^AAACCCOO^crjUCACAAC^^ 

epLy^ThrProScrProSerOlnProLy^itvsAanVJtVEnd i3vi 

; %301 TAAArj^TACATATTGrrCT^HMTTCCJVT^ 

4401 TTAATAACTTTCTCTGTCjkAAGCTTTG^ 

4501 GACTCCTCL^AC^GTGGACAGTTGriAOT 

4£01 7GTGAMG7GTAATGTCCtiATGGCIATCTAiL*P^^ 

4701 7AGCT7TTAiCTC£G^TTTAJUC^GA^ 

4S01 CAGTGTTTGTATTTGTATTTTC"GCAATjn^CTC^ 

4901 MTt^CTC7TCa;TTTGT^ATCTTACC7Al7r^^ 

5001 TCTGATAGCAGI^CMCCATCU^TTGTdAAATACT 

5101 ATGGCAATGAAAAIinM^^^^M™ 

5201 T TTCACT TTTT A TT AATC7 A CTCT AT cAATAAAaTI CTGT AATT T GAATGAGTaAAAAAAAAAAAAAAAAA (5253) 
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Lylnij^lhLasdffLkhpgkdVrllvaccLadifrIyapeapYtspdkLkdifmpItrqlkgL 

96 217 241 277 

LdtvlvnLvpahknLnkqaydL LmlgktsIsdlsehVfdlileLynidshLllsvlpqL 

19 355 375 404 

LgrfndihVpirlecVkfashcLmnhpdLakdlteyL VtaakkdIllvndhLLnfvrertLdkrwrV 
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C GGAGAGGAGGA.GGAA.CGGCAGGGCT GGCT GCGGAAGGGGA.GGGGGGGGGAGAA.GGCGAT T GGAT GCGGC GGCGGCGGCGGAT CCCGGAGAGC CCCGGAG 
TC*GCGGAGTAG^^ 

GGGGT CAAGGAAAT AT CAGAT AAAAT AT CT AAAGAGGAGAT GGTGAGACGAT TAAAGAT GGT T GT GAAAACTTT TAT ^9^^^^^S^^^^g ^ ^^f^Q 
GlyValLysGluIleSerAspLysIleSerLysGluGluMetValArgArgl^uLysMetValValLysThrPheMetAspMetAspGlr^spSerGluG 

AAGAAAAGGAGCTTT AT T T AAACCTAGCTTT ACAT CT T GCTTC^GA^ TTTTTTCTCAAGCATCCTGGT AAAGATGTT ^^^^^^^9^^ p^?^ 
luGluLysGluLeuTyrLeuAsnLeuAlaLeuHisLeuAlaSerAspPhePheLeuLysHisProGlyLysAspValArgLeuLeuValAlaCysCysLe 

T GCT GAT ATTTTCAGGATTT ATG CT CCTGAAGCTCCTT ACACATCCCCTGAT AAACT AAAGGAT AT ATTTATGTTTATAACAA 

uAlaAspIlePheArglleTyrAlaProGluAlaProTyrThrSerProAspLysLeuLysAspIlePheMetPhelleThrArgGlnLeuLysGlyLeu 
GAGGAT ACAAAGAGCCCACAATTCAATAGGT AT T TTT ATTTACT TGAGAACATTGCTTGGGTCAAGTCAT AT AACAT AT 

GluAspThrLysSerProGlnPheAsnArgTyrPheTyrLeuLeuGluAsnlleAlaTrpValLysSerTyrAsnlleCysPheGLuLeuGluAspSerA 

60 1 AT GAAATTTTCACCCAGCTATACAGAACCT TATTTTCAGTT ATAAACAATGGCCACAATCAGAAAGTCG TATGAGCTCT AT 

snGluIlePheThrGlnLeuTyrArgThrLeuPheSerVallleAsnAsnGlyHisAsnGlnLysValHisMetHisMetValAspLeuMetSerSerll 

701 TAT TTGTGAAGGTGAT ACAGT GTCTCAGGAGCT T T T GGATACGGTTTT AGT AAATCT GGTACCTGCTCATAAGAATTT AAACAAGCAAGCAT ATGATTTG 
701 y gg]^Q^ y^^p^^va l^e r G^n Gl uLeu Le uAs pTh r Va 1 Le u Va 1 As n Le u Va 1 P r o Al a H i s L y s As n Le uAsn Ly s G 1 nA 1 a T y r As pLe u 

801 gcaaaggctttactgaagagc^cagctcaAgctatt^ 

AlaLysAlaLeuLeuLysArgThrAlaGlnAlalleGluProTyrlleThrThrPhePheAsnGlnValLeuMetLeuGlyLysThrSerlleSerAspL 
1001 TGATAATGAGGAGCGCCTACAAGTTGTTAAACTACTGGCAAAAATGTTTGGGGCAAAGGA 

1101 T AC TTGGGCAGGTTT AATGATAT CCATGT ACCAATCCGCCTGGAATGTGT GAAATTT GCT AGCCAT TGTCTCATGAACCATCCT GATT TAGCAAAAGACT 
1101 ™CTTGGG^ 

TAACAGAGT AT CT T AAAGTGAGGTCACATGACCCT GAGGAAGCTATTAGACATGATGTT ATTGTGT CAAT AGTT ACAGCTGCTAAAAAGGAT AT T CTT CT 
GGTCAAT GATCACTTACT T AATTTTGTGAGAGAGAGAACATT AGACAAACGAT GGAGAGT ACGCAAAGAAGCCATGATGGGACTTGCCCAAATTT ATAAG 

AAATATGCTTTACAGTCAGCAGCTGGAAAA^ 
1 S 0 1 ATCGACTACTTGTTGAACGGATCTTTGCTCAATACATGGTTCCTCACAATTTAGAAACTACAGAACGGATG 
GGATTTAAATGCTGTGAAAGCATTGAATGAAATGTGGAAATGTCAAAATCTGCTCCGACAT 

ACAGAT GCCAGTGTCAAGGCC AT AT TTTCAAAAGTGATGGTT ATTACH 
SpAlal^ 

AC GTGT T AGAAGAT GAT GAGAAAAT AAGAAAG CAGT T AGAAGT AC TT GT T AGTC CAACAT GCT C CT GCAAGCAG GCT GAAGGTT GT GT GCGT GAAAT AAC 
TAAGAAGTTGGGCAACCCCAAACAGCCTACAAATCCTTTCCTGGAAATGATCA 
ATCAGTGCTCTTATTAAACAAGTGAACAAATCAATAGATGGAACAGC^ 

AArTrrTTAAGGTACTCTCATTTACACATCCCATCTCATTTCATTCTGCTGAAACATTTGAATCATTACTGGCTTGTCTGAAAA 

SEISED 

rTtar araarTrTTTrrcCAC^ AACTTCACAA^GGCCTTTCCCGTTTACGGCTTCCACTTGAGTATATGGCAATCTGTGCCCTTTGTGCAAA 

r ^ AC AC AAG^CTCATGCTAGGCAATGTTTGGTGAAAAAT AT AA^ 

^LysG^S^^ 

f "TCTCTTCTACCAGAGTATGT™GTTCCATATACAATTCACCTTTTGGCACATGACCCAGATTATGTCAAAGTACAGGATATTG 

AAAGAATGTCTTTGGTTTGTTCTGGAAATATTAYTGGCTAAAAATGAAAATAACAGTCAC^ 
^G^CylSuTrpPheV^ 
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3301 



3401 



3501 




SerPhePheThrProGlyLysProLysThrThrAsnValLeuGlyAlaValAsnLysProLeuSerSerAlaGlyLysGlnSerGlnThr-LysSerSerA 1151 

3601 GAATG GAAACT GT AAGCAATGCAAGCAGCAGC T CAAAT CC AAGC T C T C C T GGAAGAAT AAAGGGGAGGC T T GAT AGT T C T GAAAT GGAT CACAGT GAAAA 

rgMetGluThrValSerAsnAlaSerSerSerSerAsnProSerSerProGlyArglleLysGlyArgLeuAspSerSerGluMetAspHisSerGluAs 1184 

37 01 TGAAGATTACACAATGTCTTCACCTTTGCCGGGGAAAAAAAGTGACAAGAGAGACGACTCTGATCTTGTAAGGTCTGAAT 

nGluAspTyrThrMetSerSerProLeuProGlyLysLysSerAspLysArgAspAspSerAspLeuValArgSerGluLeuGluLysProArgGlyArg 1217 

3801 AAAAAAACGCCCGTCACAGAACAGGAGGAGAAATTAGGT 

LysLysThrProValThrGluGlnGluGluLysLeuGlyMetAspAspLeuThrLysLeuValGlnGluGlnLysProLysGlySerGlaArgSerArgL 1251 

3901 AAAGAGGCCATACGGCTTCAGAATCTGATGAACAGCAGTGGCCTGAGGAA^ 

ysArgGlyHisThrAlaSerGluSerAspGluGlnGlnTrpProGluGluLysArgLeuLysGluAspIleLeuGluAsnGluAspGluGlaAsnSerPr 128 4 

4 0 01 GCCAAAAAAGGGTAAAAGAGGCCGACCACCAAAACCTCTTGGT 

oProLysLysGlyLysArgGlyArgProProLysProLeuGlyGlyGlyThrProLysGluGluProThrMetLysThrSerLysLysGlySerLysLys 1317 

4101 AAATCTGGACCTCCAGCACCAGAGGAGGAGGAAGAAGAAGAAAGACAAAGTGGAAATACGGAACAGAAGTCCAAAAGCAA 

LysSerGlyProProAlaProGluGluGluGluGluGluGluArgGlnSerGlyAsnThrGluGlriLysSerLysSerLysGlnHisArgValSerArgA 13 51 

4 201 GAGCACAGCAGAGAGCAGAATCTCCTGAATCTAGTGCAATTGAATCCACACAGTCCACACCACAGAAAGGACGAGGAAGACCATCAAAAACGCCATCACC 

rgAlaGlnGlnArgAlaGluSerProGluSerSerAlalleGluSerThrGlnSerThrProGlnLysGlyArgGlyArgProSerLysThrProSerPr 138 4 

4 301 ATCACAACCAAAAAAAAATGTGTAAGTTGTAAATATT^ 

oSerGlnProLysLysAsnValEnd 1391 

4 401 TGTATTTAAATTCCATATATTTAGCCCCATTACACTAGGTACGGCGGCGAAGTGCTAAAAGGGAAC^ 

4 501 TGAAAGCTTTGGAAAAATCTTTTTTTTTTTTTTTTTTTTTTGGTCAAGCTTGAGGCTGAATAAAGCCTTTGATGCACAAAATGGGACTGCTGAAGAGTGG 

4 601 ACAGTTGGACCTTACTTTGGTGACCCCATACATTTGTGGTCACATGCTTTAGCCATACACATGGTAACATTGACTATGGAGTCTTGTGAAAGTGTAATGT 

4 7 01 GCGATGGCTATGTAGACATAAAGAAGAAACTTGTAAATATCTTTTTTCTTTTTTTTAATGTTTCTGATTTCTGAAGTGC 

- a 4801 CTTTAAACTGACAGTACCCGACTGTTTATTGGATCTATTGATTTGAAAAGAATTTGTTAGGATAGATCTTAAGCAGTAATCTGTCAGT 

" 4 901 ATTTTCTGC AATTTTA CTGTGAAAAAAAATTTGTTTTCAACAATTGGTGTCATTTTCTTGATGTCACTATTTGTTGGAGAGTTAAATGGTCTCTTCCCTT 

il 5001 TGTGTATCTTACCTAGTGTTTACTCCTGGGCACCCTTAATCTTCAGAGGTGCTAAATTGTCTGCCATTACACCAGAAGGATGCCTCTGATAGGAGGACAA 

- 5101 CCATGCAAATTGTGAAATAGTCCTGAAGTTCTTGGATTACTTTACACCTCAGTATTGATTTGTCCCAGAATTTTCTGGCCTTTCATGGCAATGAAAATTT 

J 5201 TAAGAAGAAAG ATTTA AAGT ATTTTAATTTTA AAGA.GTGTGTTATAA^ 

^1 5301 TCTACTGTATCAATAAAATTCTGTAATTTGAATGAGTAAAAAfWWVAAAAaAA.fi (533^) 
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I believe I am the original, first and sole inventor (if only one name is listed below) or an 
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USES THEREOF 

the specification of which 
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X is attached hereto. 

was filed on a c 



Application Serial No. 
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I do not know and do not believe that the subject matter of this application was known or 
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any foreign application(s) for patent or inventor's certificate filed by me on the same subject 
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Val Arg Arg Leu Lys Met Val Val Lys Thr Phe Met Asp Met Asp Gin 
35 40 45 

gac tct gaa gaa gaa aag gag ctt tat tta aac eta get tta cat ctt 254 
Asp Ser Glu Glu Glu Lys Glu Leu Tyr Leu Asn Leu Ala Leu His Leu 
50 55 60 

get tea gat ttt ttt etc aag cat cct ggt aaa gat gtt cgc tta ctg 302 
Ala Ser Asp Phe Phe Leu Lys His Pro Gly Lys Asp Val Arg Leu Leu 
65 70 75 

gta gee tgc tgc ctt get gat att ttc agg att tat get cct gaa get 350 
Val Ala Cys Cys Leu Ala Asp lie Phe Arg lie Tyr Ala Pro Glu Ala 
80 85 90 95 

cct tac aca tec cct gat aaa eta aag gat ata ttt atg ttt ata aca 398 
Pro Tyr Thr Ser Pro Asp Lys Leu Lys Asp lie Phe Met Phe lie Thr 
100 105 110 

aga cag ttg aag ggg eta gag gat aca aag age cca caa ttc aat agg 44 6 
Arg Gin Leu Lys Gly Leu Glu Asp Thr Lys Ser Pro Gin Phe Asn Arg 
115 120 125 

tat ttt tat tta ctt gag aac att get tgg gtc aag tea tat aac ata 494 
Tyr Phe Tyr Leu Leu Glu Asn lie Ala Trp Val Lys Ser Tyr Asn lie 
130 135 140 
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tgc ttt gag tta gaa gat age aat gaa att ttc acc cag eta tac aga 542 
Cys Phe Glu Leu Glu Asp Ser Asn Glu He Phe Thr Gin Leu Tyr Arg 
145 150 155 

acc tta ttt tea gtt ata aac aat ggc cac aat cag aaa gtc cat atg 590 
Thr Leu Phe Ser Val He Asn Asn Gly His Asn Gin Lys Val His Met 
160 165 170 175 

cac atg gta gac ctt atg age tct att att tgt gaa ggt gat aca gtg 63 8 
His Met Val Asp Leu Met Ser Ser He He Cys Glu Gly Asp Thr Val 
180 185 190 

tct cag gag ctt ttg gat acg gtt tta gta aat ctg gta cct get cat 686 
Ser Gin Glu Leu Leu Asp Thr Val Leu Val Asn Leu Val Pro Ala His 
195 200 205 

aag aat tta aac aag caa gca tat gat ttg gca aag get tta ctg aag 734 
Lys Asn Leu Asn Lys Gin Ala Tyr Asp Leu Ala Lys Ala Leu Leu Lys 
210 215 220 

agg aca get caa get att gag cca tat att acc act ttt ttt aat cag 782 
Arg Thr Ala Gin Ala He Glu Pro Tyr He Thr Thr Phe Phe Asn Gin 
225 230 235 

gtt ctg atg ctt ggg aaa aca tct ate age gat ttg tea gag cat gtc 83 0 
Val Leu Met Leu Gly Lys Thr Ser He Ser Asp Leu Ser Glu His Val 
240 245 250 255 

ttt gac tta att ttg gag etc tac aat att gat agt cat ttg ctg etc 878 
Phe Asp Leu He Leu Glu Leu Tyr Asn He Asp Ser His Leu Leu Leu 
260 265 270 

tct gtt tta ccc cag ctt gaa ttt aaa tta aag age aat gat aat gag 926 
Ser Val Leu Pro Gin Leu Glu Phe Lys Leu Lys Ser Asn Asp Asn Glu 
275 280 285 

gag cgc eta caa gtt gtt aaa eta ctg gca aaa atg ttt ggg gca aag 974 
Glu Arg Leu Gin Val Val Lys Leu Leu Ala Lys Met Phe Gly Ala Lys 
290 295 300 

gat tea gaa ttg get tct caa aac aag cca ctt tgg cag tgc tac ttg 1022 
Asp Ser Glu Leu Ala Ser Gin Asn Lys Pro Leu Trp Gin Cys Tyr Leu 
305 310 315 

ggc agg ttt aat gat ate cat gta cca ate cgc ctg gaa tgt gtg aaa 1070 
Gly Arg Phe Asn Asp He His Val Pro He Arg Leu Glu Cys Val Lys 
320 325 330 335 

ttt get age cat tgt etc atg aac cat cct gat tta gca aaa gac tta 1118 
Phe Ala Ser His Cys Leu Met Asn His Pro Asp Leu Ala Lys Asp Leu 
340 345 350 

aca gag tat ctt aaa gtg agg tea cat gac cct gag gaa get att aga 1166 
Thr Glu Tyr Leu Lys Val Arg Ser His Asp Pro Glu Glu Ala He Arg 
355 360 365 

cat gat gtt att gtg tea ata gtt aca get get aaa aag gat att ctt 1214 
His Asp Val He Val Ser He Val Thr Ala Ala Lys Lys Asp He Leu 
370 375 380 

ctg gtc aat gat cac tta ctt aat ttt gtg aga gag aga aca tta gac 1262 
Leu Val Asn Asp His Leu Leu Asn Phe Val Arg Glu Arg Thr Leu Asp 
385 390 395 
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aaa cga tgg aga gta cgc aaa gaa gcc atg atg gga ctt gcc caa att 1310 
Lys Arg Trp Arg Val Arg Lys Glu Ala Met Met Gly Leu Ala Gin lie 
400 405 410 415 

tat aag aaa tat get tta cag tea gca get gga aaa gat get gca aaa 13 58 
Tyr Lys Lys Tyr Ala Leu Gin Ser Ala Ala Gly Lys Asp Ala Ala Lys 
420 425 430 

cag ata gca tgg ate aaa gac aaa ttg eta cat ata tat tat caa aat 1406 
Gin lie Ala Trp lie Lys Asp Lys Leu Leu His lie Tyr Tyr Gin Asn 
435 440 445 

agt att gat gat cga eta ctt gtt gaa egg ate ttt get caa tac atg 14 54 
Ser lie Asp Asp Arg Leu Leu Val Glu Arg lie Phe Ala Gin Tyr Met 
450 455 460 

gtt cct cac aat tta gaa act aca gaa egg atg aaa tgc tta tat tac 1502 
Val Pro His Asn Leu Glu Thr Thr Glu Arg Met Lys Cys Leu Tyr Tyr 
465 470 475 

ttg tat gcc aca ctg gat tta aat get gtg aaa gca ttg aat gaa atg 1550 
Leu Tyr Ala Thr Leu Asp Leu Asn Ala Val Lys Ala Leu Asn Glu Met 
480 485 490 495 

tgg aaa tgt caa aat ctg etc cga cat caa gta aag gat ttg ctt gac 1598 
Trp Lys Cys Gin Asn Leu Leu Arg His Gin Val Lys Asp Leu Leu Asp 
500 505 510 

ttg att aag caa ccc aaa aca gat gcc agt gtc aag gcc ata ttt tea 1646 
Leu lie Lys Gin Pro Lys Thr Asp Ala Ser Val Lys Ala lie Phe Ser 
515 520 525 

aaa gtg atg gtt att aca aga aat tta cct gat cct ggt aag get cag 1694 
Lys Val Met Val lie Thr Arg Asn Leu Pro Asp Pro Gly Lys Ala Gin 
530 535 540 

gat ttc atg aag aaa ttc aca cag gtg tta gaa gat gat gag aaa ata 1742 
Asp Phe Met Lys Lys Phe Thr Gin Val Leu Glu Asp Asp Glu Lys lie 
545 550 555 

aga aag cag tta gaa gta ctt gtt agt cca aca tgc tec tgc aag cag 1790 
Arg Lys Gin Leu Glu Val Leu Val Ser Pro Thr Cys Ser Cys Lys Gin 
560 565 570 575 

get gaa ggt tgt gtg cgt gaa ata act aag aag ttg ggc aac ccc aaa 183 8 
Ala Glu Gly Cys Val Arg Glu lie Thr Lys Lys Leu Gly Asn Pro Lys 
580 585 590 

cag cct aca aat cct ttc ctg gaa atg ate aag ttt etc ttg gag agg 1886 
Gin Pro Thr Asn Pro Phe Leu Glu Met lie Lys Phe Leu Leu Glu Arg 
595 600 605 

ata gca cct gtg cac ata gat ace gaa tct ate agt get ctt att aaa 1934 
lie Ala Pro Val His lie Asp Thr Glu Ser He Ser Ala Leu He Lys 
610 615 620 

caa gtg aac aaa tea ata gat gga aca gca gat gat gaa gat gag ggt 1982 
Gin Val Asn Lys Ser He Asp Gly Thr Ala Asp Asp Glu Asp Glu Gly 
625 630 635 

gtt cca act gat caa gcc ate aga gca ggt ctt gaa ctg ctt aag gta 203 0 
Val Pro Thr Asp Gin Ala He Arg Ala Gly Leu Glu Leu Leu Lys Val 
640 645 650 655 
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etc tea ttt aca cat ccc ate tea ttt cat tct get gaa aca ttt gaa 2078 
Leu Ser Phe Thr His Pro lie Ser Phe His Ser Ala Glu Thr Phe Glu 
660 665 670 

tea tta ctg get tgt ctg aaa atg gat gat gaa aaa gta gca gaa get 2126 
Ser Leu Leu Ala Cys Leu Lys Met Asp Asp Glu Lys Val Ala Glu Ala 
675 680 685 

gca eta caa att ttc aaa aac aca gga age aaa att gaa gag gat ttt 2174 
Ala Leu Gin lie Phe Lys Asn Thr Gly Ser Lys lie Glu Glu Asp Phe 
690 695 700 

cca cac ate aga tea gee ttg ctt cct gtt tta cat cac aaa tct aaa 2222 
Pro His lie Arg Ser Ala Leu Leu Pro Val Leu His His Lys Ser Lys 
705 710 715 

aaa gga ccc ccc cgt caa gee aaa tat gec att cat tgt ate cat gcg 2270 
Lys Gly Pro Pro Arg Gin Ala Lys Tyr Ala He His Cys He His Ala 
720 725 730 735 

ata ttt tct agt aaa gag acc cag ttt gca cag ata ttt gag cct ctg 2318 
He Phe Ser Ser Lys Glu Thr Gin Phe Ala Gin He Phe Glu Pro Leu 
740 745 750 

cat aag age eta gat cca age aac ctg gaa cat etc ata aca cca ttg 2366 
His Lys Ser Leu Asp Pro Ser Asn Leu Glu His Leu He Thr Pro Leu 
755 760 765 

gtt act att ggt cat att get etc ctt gca cct gat caa ttt get get 2414 
Val Thr He Gly His He Ala Leu Leu Ala Pro Asp Gin Phe Ala Ala 
770 775 780 

cct tgg aaa tct tgg gta get act ttc att gtg aaa gat ctt etc atg 2462 
Pro Trp Lys Ser Trp Val Ala Thr Phe He Val Lys Asp Leu Leu Met 
785 790 795 

aat gat egg ctt cca ggg aaa aag aca act aaa ctt tgg gtt cca gat 2510 
Asn Asp Arg Leu Pro Gly Lys Lys Thr Thr Lys Leu Trp Val Pro Asp 
800 805 810 815 

gaa gaa gta tct cct gag aca atg gtc aaa att cag get att aaa atg 2558 
Glu Glu Val Ser Pro Glu Thr Met Val Lys He Gin Ala He Lys Met 
820 825 830 

atg gtt cga tgg eta ctt gga atg aaa aat aat cac agt aaa tea gga 2606 
Met Val Arg Trp Leu Leu Gly Met Lys Asn Asn His Ser Lys Ser Gly 
835 840 845 

act tct acc tta aga ttg eta aca aca ata ttg cat agt gat gga gac 2654 
Thr Ser Thr Leu Arg Leu Leu Thr Thr He Leu His Ser Asp Gly Asp 
850 855 860 

ttg aca gaa cag ggg aaa att agt aaa cca gat atg tea cgt ctg aga 2 702 
Leu Thr Glu Gin Gly Lys He Ser Lys Pro Asp Met Ser Arg Leu Arg 
865 870 875 

ctt get get ggg agt get att gtg aag ctg gca caa gaa ccc tgt tac 2750 
Leu Ala Ala Gly Ser Ala He Val Lys Leu Ala Gin Glu Pro Cys Tyr 
880 885 890 895 

cat gaa ate ate aca tta gaa caa tat cag eta tgt gca tta get ate 2798 
His Glu He He Thr Leu Glu Gin Tyr Gin Leu Cys Ala Leu Ala He 
900 905 910 
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aac gat gaa tgc tat caa gta aga caa gtg ttt gcc cag aaa ctt cac 2 846 
Asn Asp Glu Cys Tyr Gin Val Arg Gin Val Phe Ala Gin Lys Leu His 
915 920 925 

aaa ggc ctt tec cgt tta egg ctt cca ctt gag tat atg gca ate tgt 2894 
Lys Gly Leu Ser Arg Leu Arg Leu Pro Leu Glu Tyr Met Ala lie Cys 
930 935 940 

gcc ctt tgt gca aaa gat cct gta aag gag aga aga get cat get agg 2942 
Ala Leu Cys Ala Lys Asp Pro Val Lys Glu Arg Arg Ala His Ala Arg 
945 950 955 

caa tgt ttg gtg aaa aat ata aat gta agg egg gag tat ctg aag cag 2990 
Gin Cys Leu Val Lys Asn lie Asn Val Arg Arg Glu Tyr Leu Lys Gin 
960 965 970 975 

cat gca get gtt agt gaa aaa tta ttg tct ctt eta cca gag tat gtt 3038 
His Ala Ala Val Ser Glu Lys Leu Leu Ser Leu Leu Pro Glu Tyr Val 
980 985 990 

gtt cca tat aca att cac ctt ttg gca cat gac cca gat tat gtc aaa 3 086 
Val Pro Tyr Thr lie His Leu Leu Ala His Asp Pro Asp Tyr Val Lys 
995 1000 1005 

gta cag gat att gaa caa ctt aaa gat gtt aaa gaa tgt ctt tgg ttt 3134 
Val Gin Asp lie Glu Gin Leu Lys Asp Val Lys Glu Cys Leu Trp Phe 
1010 1015 1020 

gtt ctg gaa ata tta atg get aaa aat gaa aat aac agt cac get ttt 3182 
Val Leu Glu lie Leu Met Ala Lys Asn Glu Asn Asn Ser His Ala Phe 
1025 1030 1035 

ate aga aag atg gta gaa aat att aaa caa aca aaa gat gee caa gga 3 23 0 
lie Arg Lys Met Val Glu Asn lie Lys Gin Thr Lys Asp Ala Gin Gly 
1040 1045 1050 1055 

cca gat gat gca aaa atg aat gaa aaa ctg tac act gtg tgt gat gtt 3278 
Pro Asp Asp Ala Lys Met Asn Glu Lys Leu Tyr Thr Val Cys Asp Val 
1060 1065 1070 

gcc atg aat ate ate atg tea aag agt act aca tac agt ttg gaa tct 3326 
Ala Met Asn lie lie Met Ser Lys Ser Thr Thr Tyr Ser Leu Glu Ser 
1075 1080 1085 

cct aaa gac ccg gta eta cca get cgt ttc ttc act caa cct gac aag 3374 
Pro Lys Asp Pro Val Leu Pro Ala Arg Phe Phe Thr Gin Pro Asp Lys 
1090 1095 1100 

aat ttc agt aac ace aaa aat tat ctg cct cct gaa atg aaa tea ttt 3422 
Asn Phe Ser Asn Thr Lys Asn Tyr Leu Pro Pro Glu Met Lys Ser Phe 
1105 1110 1115 

ttc act cct gga aaa cct aaa aca ace aat gtt eta gga get gtt aac 3470 
Phe Thr Pro Gly Lys Pro Lys Thr Thr Asn Val Leu Gly Ala Val Asn 
1120 1125 1130 1135 

aag cca ctt tea tea gca ggc aag caa tct cag ace aaa tea tea cga 3518 
Lys Pro Leu Ser Ser Ala Gly Lys Gin Ser Gin Thr Lys Ser Ser Arg 
1140 1145 1150 

atg gaa act gta age aat gca age age age tea aat cca age tct cct 3 566 
Met Glu Thr Val Ser Asn Ala Ser Ser Ser Ser Asn Pro Ser Ser Pro 
1155 1160 1165 
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gga aga ata aag ggg agg ctt gat agt tct gaa atg gat cac agt gaa 
Gly Arg lie Lys Gly Arg Leu Asp Ser Ser Glu Met Asp His Ser Glu 
1170 1175 1180 



3614 



aat gaa gat tac aca atg tct tea cct ttg ccg ggg aaa aaa agt gac 3662 
Asn Glu Asp Tyr Thr Met Ser Ser Pro Leu Pro Gly Lys Lys Ser Asp 
1185 1190 1195 

aag aga gac gac tct gat ctt gta agg tct gaa ttg gag aag cct aga 3 710 
Lys Arg Asp Asp Ser Asp Leu Val Arg Ser Glu Leu Glu Lys Pro Arg 
1200 1205 1210 1215 

ggc agg aaa aaa acg ccc gtc aca gaa cag gag gag aaa tta ggt atg 3758 
Gly Arg Lys Lys Thr Pro Val Thr Glu Gin Glu Glu Lys Leu Gly Met 
1220 1225 1230 

gat gac ttg act aag ttg gta cag gaa cag aaa cct aaa ggc agt cag 3 8 06 
Asp Asp Leu Thr Lys Leu Val Gin Glu Gin Lys Pro Lys Gly Ser Gin 
1235 1240 1245 

cga agt egg aaa aga ggc cat acg get tea gaa tct gat gaa cag cag 3 854 
Arg Ser Arg Lys Arg Gly His Thr Ala Ser Glu Ser Asp Glu Gin Gin 
1250 1255 1260 

tgg cct gag gaa aag agg etc aaa gaa gat ata tta gaa aat gaa gat 3 902 
Trp Pro Glu Glu Lys Arg Leu Lys Glu Asp lie Leu Glu Asn Glu Asp 
1265 1270 1275 

gaa cag aat agt ccg cca aaa aag ggt aaa aga ggc cga cca cca aaa 3 950 
Glu Gin Asn Ser Pro Pro Lys Lys Gly Lys Arg Gly Arg Pro Pro Lys 
1280 1285 1290 1295 

cct ctt ggt gga ggt aca cca aaa gaa gag cca aca atg aaa act tct 3998 
Pro Leu Gly Gly Gly Thr Pro Lys Glu Glu Pro Thr Met Lys Thr Ser 
1300 1305 1310 

aaa aaa gga age aaa aaa aaa tct gga cct cca gca cca gag gag gag 4 04 6 
Lys Lys Gly Ser Lys Lys Lys Ser Gly Pro Pro Ala Pro Glu Glu Glu 
1315 1320 1325 

gaa gaa gaa gaa aga caa agt gga aat acg gaa cag aag tec aaa age 4 094 
Glu Glu Glu Glu Arg Gin Ser Gly Asn Thr Glu Gin Lys Ser Lys Ser 
1330 1335 1340 

aaa cag cac cga gtg tea agg aga gca cag cag aga gca gaa tct cct 4142 
Lys Gin His Arg Val Ser Arg Arg Ala Gin Gin Arg Ala Glu Ser Pro 
1345 1350 1355 

gaa tct agt gca att gaa tec aca cag tec aca cca cag aaa gga cga 4190 
Glu Ser Ser Ala lie Glu Ser Thr Gin Ser Thr Pro Gin Lys Gly Arg 
1360 1365 1370 1375 

gga aga cca tea aaa acg cca tea cca tea caa cca aaa aaa aat gtg 423 8 
Gly Arg Pro Ser Lys Thr Pro Ser Pro Ser Gin Pro Lys Lys Asn Val 
1380 1385 1390 



taagttgtaa atattacatt tcaaaccaat 
tgtaaacata catattgetg tatttaaatt 
ggcggcgaag tgctaaaagg gaaeggegat 
aaagctttgg aaaaatcttt tttttttttt 



ttcaaattat tttgcaaaag ttcctaaatt 4298 
ccatatattt agccccatta cactaggtac 4358 
gaacaaatgt aattaataac tttctctgtg 4418 
tttttttttg gtcaagcttg aggctgaata 4478 
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aagcctttga tgcacaaaat gggactgctg aagagtggac agttggacct tactttggtg 453 8 
accccataca tttgtggtca catgctttag ccatacacat ggtaacattg actatggagt 4598 
cttgtgaaag tgtaatgtgc gatggctatg tagacataaa gaagaaactt gtaaatatct 4658 
tttttctttt ttttaatgtt tctgatttct gaagtgcttg tatagctttt atctgcggct 4718 
ttaaactgac agtacccgac tgtttattgg atctattgat ttgaaaagaa tttgttagga 4778 
tagatcttaa gcagtaatct gtcagtgttt gtatttgtat tttctgcaat tttactgtga 483 8 
aaaaaaattt gttttcaaca attggtgtca ttttcttgat gtcactattt gttggagagt 4898 
taaatggtct cttccctttg tgtatcttac ctagtgttta ctcctgggca cccttaatct 4958 
tcagaggtgc taaattgtct gccattacac cagaaggatg cctctgatag gaggacaacc 5018 
atgcaaattg tgaaatagtc ctgaagttct tggattactt tacacctcag tattgatttg 5078 
tcccagaatt ttctggcctt tcatggcaat gaaaatttta agaagaaaga tttaaagtat 513 8 
tttaatttta aagagtgtgt tataaaataa tgtactgaat tctttatccc attttatcat 5198 
cctttcagtt tttattaatc tactgtatca ataaaattct gtaatttgaa tgagtaaaaa 5258 
aaaaaaaaaa aaa 5271 



<210> 2 
<211> 1391 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Ala His Ser Lys Thr Arg Thr Asn Asp Gly Lys lie Thr Tyr Pro 
15 10 15 

Pro Gly Val Lys Glu lie Ser Asp Lys lie Ser Lys Glu Glu Met Val 
20 25 30 

Arg Arg Leu Lys Met Val Val Lys Thr Phe Met Asp Met Asp Gin Asp 
35 40 45 

Ser Glu Glu Glu Lys Glu Leu Tyr Leu Asn Leu Ala Leu His Leu Ala 
50 55 60 

Ser Asp Phe Phe Leu Lys His Pro Gly Lys Asp Val Arg Leu Leu Val 
65 70 75 80 

Ala Cys Cys Leu Ala Asp lie Phe Arg lie Tyr Ala Pro Glu Ala Pro 
85 90 95 

Tyr Thr Ser Pro Asp Lys Leu Lys Asp lie Phe Met Phe lie Thr Arg 
100 105 110 

Gin Leu Lys Gly Leu Glu Asp Thr Lys Ser Pro Gin Phe Asn Arg Tyr 
115 120 125 

Phe Tyr Leu Leu Glu Asn lie Ala Trp Val Lys Ser Tyr Asn lie Cys 
130 135 140 

Phe Glu Leu Glu Asp Ser Asn Glu lie Phe Thr Gin Leu Tyr Arg Thr 
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145 150 155 160 

Leu Phe Ser Val lie Asn Asn Gly His Asn Gin Lys Val His Met His 
165 170 175 

Met Val Asp Leu Met Ser Ser lie lie Cys Glu Gly Asp Thr Val Ser 
180 185 190 

Gin Glu Leu Leu Asp Thr Val Leu Val Asn Leu Val Pro Ala His Lys 
195 200 205 

Asn Leu Asn Lys Gin Ala Tyr Asp Leu Ala Lys Ala Leu Leu Lys Arg 
210 215 220 

Thr Ala Gin Ala lie Glu Pro Tyr lie Thr Thr Phe Phe Asn Gin Val 
225 230 235 240 

Leu Met Leu Gly Lys Thr Ser lie Ser Asp Leu Ser Glu His Val Phe 
245 250 255 

Asp Leu lie Leu Glu Leu Tyr Asn lie Asp Ser His Leu Leu Leu Ser 
260 265 270 

Val Leu Pro Gin Leu Glu Phe Lys Leu Lys Ser Asn Asp Asn Glu Glu 
275 280 285 

Arg Leu Gin Val Val Lys Leu Leu Ala Lys Met Phe Gly Ala Lys Asp 
290 295 300 

Ser Glu Leu Ala Ser Gin Asn Lys Pro Leu Trp Gin Cys Tyr Leu Gly 
305 310 315 320 

Arg Phe Asn Asp lie His Val Pro lie Arg Leu Glu Cys Val Lys Phe 
325 330 335 

Ala Ser His Cys Leu Met Asn His Pro Asp Leu Ala Lys Asp Leu Thr 
340 345 350 

Glu Tyr Leu Lys Val Arg Ser His Asp Pro Glu Glu Ala lie Arg His 
355 360 365 

Asp Val lie Val Ser lie Val Thr Ala Ala Lys Lys Asp lie Leu Leu 
370 375 380 

Val Asn Asp His Leu Leu Asn Phe Val Arg Glu Arg Thr Leu Asp Lys 
385 390 395 400 

Arg Trp Arg Val Arg Lys Glu Ala Met Met Gly Leu Ala Gin lie Tyr 
405 410 415 

Lys Lys Tyr Ala Leu Gin Ser Ala Ala Gly Lys Asp Ala Ala Lys Gin 
420 425 430 

lie Ala Trp lie Lys Asp Lys Leu Leu His lie Tyr Tyr Gin Asn Ser 
435 440 445 

lie Asp Asp Arg Leu Leu Val Glu Arg He Phe Ala Gin Tyr Met Val 
450 455 460 

Pro His Asn Leu Glu Thr Thr Glu Arg Met Lys Cys Leu Tyr Tyr Leu 
465 470 475 480 

Tyr Ala Thr Leu Asp Leu Asn Ala Val Lys Ala Leu Asn Glu Met Trp 
485 490 495 
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Lys Cys Gin Asn Leu Leu Arg His Gin Val Lys Asp Leu Leu Asp Leu 
500 505 510 



lie Lys Gin Pro 
515 

Val Met Val lie 
530 

Phe Met Lys Lys 
545 

Lys Gin Leu Glu 



Glu Gly Cys Val 
580 

Pro Thr Asn Pro 
595 

Ala Pro Val His 
610 

Val Asn Lys Ser 
625 

Pro Thr Asp Gin 



Ser Phe Thr His 
660 

Leu Leu Ala Cys 
675 

Leu Gin lie Phe 
690 

His lie Arg Ser 
705 

Gly Pro Pro Arg 



Phe Ser Ser Lys 
740 

Lys Ser Leu Asp 
755 

Thr He Gly His 
770 

Trp Lys Ser Trp 
785 

Asp Arg Leu Pro 



Glu Val Ser Pro 
820 



Lys Thr Asp Ala 
520 

Thr Arg Asn Leu 
535 

Phe Thr Gin Val 
550 

Val Leu Val Ser 
565 

Arg Glu He Thr 



Phe Leu Glu Met 
600 

He Asp Thr Glu 
615 

He Asp Gly Thr 
630 

Ala He Arg Ala 
645 

Pro He Ser Phe 



Leu Lys Met Asp 
680 

Lys Asn Thr Gly 
695 

Ala Leu Leu Pro 
710 

Gin Ala Lys Tyr 
725 

Glu Thr Gin Phe 



Pro Ser Asn Leu 
760 

He Ala Leu Leu 
775 

Val Ala Thr Phe 
790 

Gly Lys Lys Thr 
805 

Glu Thr Met Val 



Ser Val Lys Ala 



Pro Asp Pro Gly 
540 

Leu Glu Asp Asp 
555 

Pro Thr Cys Ser 
570 

Lys Lys Leu Gly 
585 

He Lys Phe Leu 



Ser He Ser Ala 
620 

Ala Asp Asp Glu 

635 

Gly Leu Glu Leu 
650 

His Ser Ala Glu 
665 

Asp Glu Lys Val 



Ser Lys He Glu 
700 

Val Leu His His 
715 

Ala He His Cys 
730 

Ala Gin He Phe 
745 

Glu His Leu He 



Ala Pro Asp Gin 
780 

He Val Lys Asp 
795 

Thr Lys Leu Trp 
810 

Lys He Gin Ala 
825 



He Phe Ser Lys 
525 

Lys Ala Gin Asp 



Glu Lys He Arg 
560 

Cys Lys Gin Ala 
575 

Asn Pro Lys Gin 
590 

Leu Glu Arg He 
605 

Leu He Lys Gin 



Asp Glu Gly Val 
640 

Leu Lys Val Leu 
655 

Thr Phe Glu Ser 
670 

Ala Glu Ala Ala 
685 

Glu Asp Phe Pro 



Lys Ser Lys Lys 
720 

He His Ala He 
735 

Glu Pro Leu His 
750 

Thr Pro Leu Val 
765 

Phe Ala Ala Pro 



Leu Leu Met Asn 
800 

Val Pro Asp Glu 
815 

He Lys Met Met 
830 
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Val Arg Trp Leu Leu Gly Met Lys Asn Asn His Ser Lys Ser Gly Thr 
835 840 845 

Ser Thr Leu Arg Leu Leu Thr Thr lie Leu His Ser Asp Gly Asp Leu 
850 855 860 

Thr Glu Gin Gly Lys lie Ser Lys Pro Asp Met Ser Arg Leu Arg Leu 
865 870 875 880 

Ala Ala Gly Ser Ala lie Val Lys Leu Ala Gin Glu Pro Cys Tyr His 
885 890 895 

Glu lie lie Thr Leu Glu Gin Tyr Gin Leu Cys Ala Leu Ala lie Asn 
900 905 910 

Asp Glu Cys Tyr Gin Val Arg Gin Val Phe Ala Gin Lys Leu His Lys 
915 920 925 

Gly Leu Ser Arg Leu Arg Leu Pro Leu Glu Tyr Met Ala lie Cys Ala 
930 935 940 

Leu Cys Ala Lys Asp Pro Val Lys Glu Arg Arg Ala His Ala Arg Gin 
945 950 955 960 

Cys Leu Val Lys Asn lie Asn Val Arg Arg Glu Tyr Leu Lys Gin His 
965 970 975 

Ala Ala Val Ser Glu Lys Leu Leu Ser Leu Leu Pro Glu Tyr Val Val 
980 985 990 

Pro Tyr Thr lie His Leu Leu Ala His Asp Pro Asp Tyr Val Lys Val 
995 1000 1005 

Gin Asp lie Glu Gin Leu Lys Asp Val Lys Glu Cys Leu Trp Phe Val 
1010 1015 1020 

Leu Glu lie Leu Met Ala Lys Asn Glu Asn Asn Ser His Ala Phe lie 
025 1030 1035 104 

Arg Lys Met Val Glu Asn lie Lys Gin Thr Lys Asp Ala Gin Gly Pro 
1045 1050 1055 

Asp Asp Ala Lys Met Asn Glu Lys Leu Tyr Thr Val Cys Asp Val Ala 
1060 1065 1070 

Met Asn lie lie Met Ser Lys Ser Thr Thr Tyr Ser Leu Glu Ser Pro 
1075 1080 1085 

Lys Asp Pro Val Leu Pro Ala Arg Phe Phe Thr Gin Pro Asp Lys Asn 
1090 1095 1100 

Phe Ser Asn Thr Lys Asn Tyr Leu Pro Pro Glu Met Lys Ser Phe Phe 
105 1110 1115 112 

Thr Pro Gly Lys Pro Lys Thr Thr Asn Val Leu Gly Ala Val Asn Lys 
1125 1130 1135 

Pro Leu Ser Ser Ala Gly Lys Gin Ser Gin Thr Lys Ser Ser Arg Met 
1140 1145 1150 

Glu Thr Val Ser Asn Ala Ser Ser Ser Ser Asn Pro Ser Ser Pro Gly 
1155 1160 1165 

Arg lie Lys Gly Arg Leu Asp Ser Ser Glu Met Asp His Ser Glu Asn 
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1170 1175 1180 

Glu Asp Tyr Thr Met Ser Ser Pro Leu Pro Gly Lys Lys Ser Asp Lys 
185 1190 1195 120 

Arg Asp Asp Ser Asp Leu Val Arg Ser Glu Leu Glu Lys Pro Arg Gly 
1205 1210 1215 

Arg Lys Lys Thr Pro Val Thr Glu Gin Glu Glu Lys Leu Gly Met Asp 
1220 1225 1230 

Asp Leu Thr Lys Leu Val Gin Glu Gin Lys Pro Lys Gly Ser Gin Arg 
1235 1240 1245 

Ser Arg Lys Arg Gly His Thr Ala Ser Glu Ser Asp Glu Gin Gin Trp 
1250 1255 1260 

Pro Glu Glu Lys Arg Leu Lys Glu Asp lie Leu Glu Asn Glu Asp Glu 
265 1270 1275 128 

Gin Asn Ser Pro Pro Lys Lys Gly Lys Arg Gly Arg Pro Pro Lys Pro 
1285 1290 1295 

Leu Gly Gly Gly Thr Pro Lys Glu Glu Pro Thr Met Lys Thr Ser Lys 
1300 1305 1310 

Lys Gly Ser Lys Lys Lys Ser Gly Pro Pro Ala Pro Glu Glu Glu Glu 
1315 1320 1325 

Glu Glu Glu Arg Gin Ser Gly Asn Thr Glu Gin Lys Ser Lys Ser Lys 
1330 1335 1340 

Gin His Arg Val Ser Arg Arg Ala Gin Gin Arg Ala Glu Ser Pro Glu 
345 1350 1355 136 

Ser Ser Ala lie Glu Ser Thr Gin Ser Thr Pro Gin Lys Gly Arg Gly 
1365 1370 1375 

Arg Pro Ser Lys Thr Pro Ser Pro Ser Gin Pro Lys Lys Asn Val 
1380 1385 1390 



<210> 3 

<211> 4173 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (4173) 

<400> 3 

atg get cat tea aag act agg acc aat gat gga aaa att aca tat ccg 4 8 
Met Ala His Ser Lys Thr Arg Thr Asn Asp Gly Lys He Thr Tyr Pro 
15 10 15 

cct ggg gtc aag gaa ata tea gat aaa ata tct aaa gag gag atg gtg 96 
Pro Gly Val Lys Glu He Ser Asp Lys He Ser Lys Glu Glu Met Val 
20 25 30 

aga cga tta aag atg gtt gtg aaa act ttt atg gat atg gac cag gac 144 
Arg Arg Leu Lys Met Val Val Lys Thr Phe Met Asp Met Asp Gin Asp 
35 40 45 
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tct gaa gaa gaa aag gag ctt tat tta aac eta get tta cat ctt get 192 
Ser Glu Glu Glu Lys Glu Leu Tyr Leu Asn Leu Ala Leu His Leu Ala 
50 55 60 

tea gat ttt ttt etc aag cat cct ggt aaa gat gtt cgc tta ctg gta 240 
Ser Asp Phe Phe Leu Lys His Pro Gly Lys Asp Val Arg Leu Leu Val 
65 70 75 80 

gee tgc tgc ctt get gat att ttc agg att tat get cct gaa get cct 288 
Ala Cys Cys Leu Ala Asp lie Phe Arg lie Tyr Ala Pro Glu Ala Pro 
85 90 95 

tac aca tec cct gat aaa eta aag gat ata ttt atg ttt ata aca aga 336 
Tyr Thr Ser Pro Asp Lys Leu Lys Asp lie Phe Met Phe He Thr Arg 
100 105 110 

cag ttg aag ggg eta gag gat aca aag age cca caa ttc aat agg tat 384 
Gin Leu Lys Gly Leu Glu Asp Thr Lys Ser Pro Gin Phe Asn Arg Tyr 
115 120 125 

ttt tat tta ctt gag aac att get tgg gtc aag tea tat aac ata tgc 432 
Phe Tyr Leu Leu Glu Asn He Ala Trp Val Lys Ser Tyr Asn He Cys 
130 135 140 

ttt gag tta gaa gat age aat gaa att ttc ace cag eta tac aga ace 480 
Phe Glu Leu Glu Asp Ser Asn Glu He Phe Thr Gin Leu Tyr Arg Thr 
145 150 155 160 

tta ttt tea gtt ata aac aat ggc cac aat cag aaa gtc cat atg cac 528 
Leu Phe Ser Val He Asn Asn Gly His Asn Gin Lys Val His Met His 
165 170 175 

atg gta gac ctt atg age tct att att tgt gaa ggt gat aca gtg tct 576 
Met Val Asp Leu Met Ser Ser He He Cys Glu Gly Asp Thr Val Ser 
180 185 190 

cag gag ctt ttg gat acg gtt tta gta aat ctg gta cct get cat aag 624 
Gin Glu Leu Leu Asp Thr Val Leu Val Asn Leu Val Pro Ala His Lys 
195 200 205 

aat tta aac aag caa gca tat gat ttg gca aag get tta ctg aag agg 672 
Asn Leu Asn Lys Gin Ala Tyr Asp Leu Ala Lys Ala Leu Leu Lys Arg 
210 215 220 

aca get caa get att gag cca tat att ace act ttt ttt aat cag gtt 720 
Thr Ala Gin Ala He Glu Pro Tyr He Thr Thr Phe Phe Asn Gin Val 
225 230 235 240 

ctg atg ctt ggg aaa aca tct ate age gat ttg tea gag cat gtc ttt 768 
Leu Met Leu Gly Lys Thr Ser He Ser Asp Leu Ser Glu His Val Phe 
245 250 255 

gac tta att ttg gag etc tac aat att gat agt cat ttg ctg etc tct 816 
Asp Leu He Leu Glu Leu Tyr Asn He Asp Ser His Leu Leu Leu Ser 
260 265 270 

gtt tta ccc cag ctt gaa ttt aaa tta aag age aat gat aat gag gag 864 
Val Leu Pro Gin Leu Glu Phe Lys Leu Lys Ser Asn Asp Asn Glu Glu 
275 280 285 

cgc eta caa gtt gtt aaa eta ctg gca aaa atg ttt ggg gca aag gat 912 
Arg Leu Gin Val Val Lys Leu Leu Ala Lys Met Phe Gly Ala Lys Asp 
290 295 300 
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att aag caa ccc aaa aca gat gcc agt gtc aag gcc ata ttt tea aaa 
lie Lys Gin Pro Lys Thr Asp Ala Ser Val Lys Ala lie Phe Ser Lys 
515 520 525 

gtg atg gtt att aca aga aat tta cct gat cct ggt aag get cag gat 
Val Met Val lie Thr Arg Asn Leu Pro Asp Pro Gly Lys Ala Gin Asp 
530 535 540 

ttc atg aag aaa ttc aca cag gtg tta gaa gat gat gag aaa ata aga 
Phe Met Lys Lys Phe Thr Gin Val Leu Glu Asp Asp Glu Lys lie Arg 
545 550 555 560 



1008 



1056 



1104 



1152 



1200 



1248 



tea gaa ttg get tct caa aac aag cca ctt tgg cag tgc tac ttg ggc 960 
Ser Glu Leu Ala Ser Gin Asn Lys Pro Leu Trp Gin Cys Tyr Leu Gly 
305 310 315 320 

agg ttt aat gat ate cat gta cca ate cgc ctg gaa tgt gtg aaa ttt 
Arg Phe Asn Asp lie His Val Pro lie Arg Leu Glu Cys Val Lys Phe 
325 330 335 

get age cat tgt etc atg aac cat cct gat tta gca aaa gac tta aca 
Ala Ser His Cys Leu Met Asn His Pro Asp Leu Ala Lys Asp Leu Thr 
340 345 350 

gag tat ctt aaa gtg agg tea cat gac cct gag gaa get att aga cat 
Glu Tyr Leu Lys Val Arg Ser His Asp Pro Glu Glu Ala lie Arg His 
355 360 365 

gat gtt att gtg tea ata gtt aca get get aaa aag gat att ctt ctg 
Asp Val He Val Ser He Val Thr Ala Ala Lys Lys Asp He Leu Leu 
370 375 380 

gtc aat gat cac tta ctt aat ttt gtg aga gag aga aca tta gac aaa 
Val Asn Asp His Leu Leu Asn Phe Val Arg Glu Arg Thr Leu Asp Lys 
385 390 395 400 

cga tgg aga gta cgc aaa gaa gcc atg atg gga ctt gcc caa att tat 
Arg Trp Arg Val Arg Lys Glu Ala Met Met Gly Leu Ala Gin He Tyr 
405 410 415 

aag aaa tat get tta cag tea gca get gga aaa gat get gca aaa cag 1296 
Lys Lys Tyr Ala Leu Gin Ser Ala Ala Gly Lys Asp Ala Ala Lys Gin 
420 425 430 

ata gca tgg ate aaa gac aaa ttg eta cat ata tat tat caa aat agt 1344 
He Ala Trp He Lys Asp Lys Leu Leu His He Tyr Tyr Gin Asn Ser 
435 440 445 

att gat gat cga eta ctt gtt gaa egg ate ttt get caa tac atg gtt 1392 
He Asp Asp Arg Leu Leu Val Glu Arg He Phe Ala Gin Tyr Met Val 
450 455 460 

cct cac aat tta gaa act aca gaa egg atg aaa tgc tta tat tac ttg 1440 
Pro His Asn Leu Glu Thr Thr Glu Arg Met Lys Cys Leu Tyr Tyr Leu 
465 470 475 480 

tat gcc aca ctg gat tta aat get gtg aaa gca ttg aat gaa atg tgg 
Tyr Ala Thr Leu Asp Leu Asn Ala Val Lys Ala Leu Asn Glu Met Trp 
485 490 495 

aaa tgt caa aat ctg etc cga cat caa gta aag gat ttg ctt gac ttg 1536 
Lys Cys Gin Asn Leu Leu Arg His Gin Val Lys Asp Leu Leu Asp Leu 
500 505 510 



1488 



1584 



1632 



1680 
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aag cag tta gaa gta ctt gtt agt cca aca tgc tec tgc aag cag get 1728 

Lys Gin Leu Glu Val Leu Val Ser Pro Thr Cys Ser Cys Lys Gin Ala 

565 570 575 

gaa ggt tgt gtg cgt gaa ata act aag aag ttg ggc aac ccc aaa cag 1776 

Glu Gly Cys Val Arg Glu lie Thr Lys Lys Leu Gly Asn Pro Lys Gin 
580 585 590 

cct aca aat cct ttc ctg gaa atg ate aag ttt etc ttg gag agg ata 1824 

Pro Thr Asn Pro Phe Leu Glu Met lie Lys Phe Leu Leu Glu Arg lie 

595 600 605 

gca cct gtg cac ata gat acc gaa tct ate agt get ctt att aaa caa 1872 

Ala Pro Val His lie Asp Thr Glu Ser He Ser Ala Leu He Lys Gin 

610 615 620 

gtg aac aaa tea ata gat gga aca gca gat gat gaa gat gag ggt gtt 1920 

Val Asn Lys Ser He Asp Gly Thr Ala Asp Asp Glu Asp Glu Gly Val 

625 630 635 640 

cca act gat caa gec ate aga gca ggt ctt gaa ctg ctt aag gta etc 1968 

Pro Thr Asp Gin Ala He Arg Ala Gly Leu Glu Leu Leu Lys Val Leu 

645 650 655 

tea ttt aca cat ccc ate tea ttt cat tct get gaa aca ttt gaa tea 2016 

Ser Phe Thr His Pro He Ser Phe His Ser Ala Glu Thr Phe Glu Ser 
660 665 670 

tta ctg get tgt ctg aaa atg gat gat gaa aaa gta gca gaa get gca 2064 

Leu Leu Ala Cys Leu Lys Met Asp Asp Glu Lys Val Ala Glu Ala Ala 

675 680 685 

eta caa att ttc aaa aac aca gga age aaa att gaa gag gat ttt cca 2112 

Leu Gin He Phe Lys Asn Thr Gly Ser Lys He Glu Glu Asp Phe Pro 

690 695 700 

cac ate aga tea gec ttg ctt cct gtt tta cat cac aaa tct aaa aaa 2160 

His He Arg Ser Ala Leu Leu Pro Val Leu His His Lys Ser Lys Lys 

705 710 715 720 

gga ccc ccc cgt caa gec aaa tat gec att cat tgt ate cat gcg ata 2208 

Gly Pro Pro Arg Gin Ala Lys Tyr Ala He His Cys He His Ala He 

725 730 735 

ttt tct agt aaa gag acc cag ttt gca cag ata ttt gag cct ctg cat 2256 

Phe Ser Ser Lys Glu Thr Gin Phe Ala Gin He Phe Glu Pro Leu His 
740 745 750 

aag age eta gat cca age aac ctg gaa cat etc ata aca cca ttg gtt 2304 

Lys Ser Leu Asp Pro Ser Asn Leu Glu His Leu He Thr Pro Leu Val 

755 760 765 

act att ggt cat att get etc ctt gca cct gat caa ttt get get cct 2352 

Thr He Gly His He Ala Leu Leu Ala Pro Asp Gin Phe Ala Ala Pro 

770 775 780 

tgg aaa tct tgg gta get act ttc att gtg aaa gat ctt etc atg aat 2400 

Trp Lys Ser Trp Val Ala Thr Phe He Val Lys Asp Leu Leu Met Asn 

785 790 795 800 

gat egg ctt cca ggg aaa aag aca act aaa ctt tgg gtt cca gat gaa 244 8 

Asp Arg Leu Pro Gly Lys Lys Thr Thr Lys Leu Trp Val Pro Asp Glu 

805 810 815 
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gaa gta tct cct gag aca atg gtc aaa att cag get att aaa atg atg 2496 
Glu Val Ser Pro Glu Thr Met Val Lys lie Gin Ala lie Lys Met Met 
820 825 830 

gtt cga tgg eta ctt gga atg aaa aat aat cac agt aaa tea gga act 2544 
Val Arg Trp Leu Leu Gly Met Lys Asn Asn His Ser Lys Ser Gly Thr 
835 840 845 

tct acc tta aga ttg eta aca aca ata ttg cat agt gat gga gac ttg 2592 
Ser Thr Leu Arg Leu Leu Thr Thr lie Leu His Ser Asp Gly Asp Leu 
850 855 860 

aca gaa cag ggg aaa att agt aaa cca gat atg tea cgt ctg aga ctt 2640 
Thr Glu Gin Gly Lys lie Ser Lys Pro Asp Met Ser Arg Leu Arg Leu 
865 870 875 880 

get get ggg agt get att gtg aag ctg gca caa gaa ccc tgt tac cat 2688 
Ala Ala Gly Ser Ala lie Val Lys Leu Ala Gin Glu Pro Cys Tyr His 
885 890 895 

gaa ate ate aca tta gaa caa tat cag eta tgt gca tta get ate aac 2736 
Glu lie He Thr Leu Glu Gin Tyr Gin Leu Cys Ala Leu Ala He Asn 
900 905 910 

gat gaa tgc tat caa gta aga caa gtg ttt gee cag aaa ctt cac aaa 2784 
Asp Glu Cys Tyr Gin Val Arg Gin Val Phe Ala Gin Lys Leu His Lys 
915 920 925 

ggc ctt tec cgt tta egg ctt cca ctt gag tat atg gca ate tgt gee 2832 
Gly Leu Ser Arg Leu Arg Leu Pro Leu Glu Tyr Met Ala He Cys Ala 
930 935 940 

ctt tgt gca aaa gat cct gta aag gag aga aga get cat get agg caa 2880 
Leu Cys Ala Lys Asp Pro Val Lys Glu Arg Arg Ala His Ala Arg Gin 
945 950 955 960 

tgt ttg gtg aaa aat ata aat gta agg egg gag tat ctg aag cag cat 2928 
Cys Leu Val Lys Asn He Asn Val Arg Arg Glu Tyr Leu Lys Gin His 
965 970 975 

gca get gtt agt gaa aaa tta ttg tct ctt eta cca gag tat gtt gtt 2976 
Ala Ala Val Ser Glu Lys Leu Leu Ser Leu Leu Pro Glu Tyr Val Val 
980 985 990 

cca tat aca att cac ctt ttg gca cat gac cca gat tat gtc aaa gta 3024 
Pro Tyr Thr He His Leu Leu Ala His Asp Pro Asp Tyr Val Lys Val 
995 1000 1005 

cag gat att gaa caa ctt aaa gat gtt aaa gaa tgt ctt tgg ttt gtt 3072 
Gin Asp He Glu Gin Leu Lys Asp Val Lys Glu Cys Leu Trp Phe Val 
1010 1015 1020 

ctg gaa ata tta atg get aaa aat gaa aat aac agt cac get ttt ate 3120 
Leu Glu He Leu Met Ala Lys Asn Glu Asn Asn Ser His Ala Phe He 
1025 1030 1035 1040 

aga aag atg gta gaa aat att aaa caa aca aaa gat gee caa gga cca 316 8 
Arg Lys Met Val Glu Asn He Lys Gin Thr Lys Asp Ala Gin Gly Pro 
1045 1050 1055 

gat gat gca aaa atg aat gaa aaa ctg tac act gtg tgt gat gtt gec 3216 
Asp Asp Ala Lys Met Asn Glu Lys Leu Tyr Thr Val Cys Asp Val Ala 
1060 1065 1070 
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atg aat ate ate atg tea aag agt act aca tac agt ttg gaa tct cct 3264 
Met Asn lie lie Met Ser Lys Ser Thr Thr Tyr Ser Leu Glu Ser Pro 
1075 1080 1085 

aaa gac ccg gta eta cca get cgt ttc ttc act caa cct gac aag aat 3312 
Lys Asp Pro Val Leu Pro Ala Arg Phe Phe Thr Gin Pro Asp Lys Asn 
1090 1095 1100 

ttc agt aac ace aaa aat tat ctg cct cct gaa atg aaa tea ttt ttc 3360 
Phe Ser Asn Thr Lys Asn Tyr Leu Pro Pro Glu Met Lys Ser Phe Phe 
1105 1110 1115 1120 

act cct gga aaa cct aaa aca ace aat gtt eta gga get gtt aac aag 3408 
Thr Pro Gly Lys Pro Lys Thr Thr Asn Val Leu Gly Ala Val Asn Lys 
1125 1130 1135 

cca ctt tea tea gca ggc aag caa tct cag ace aaa tea tea cga atg 3456 
Pro Leu Ser Ser Ala Gly Lys Gin Ser Gin Thr Lys Ser Ser Arg Met 
1140 1145 1150 

gaa act gta age aat gca age age age tea aat cca age tct cct gga 3 504 
Glu Thr Val Ser Asn Ala Ser Ser Ser Ser Asn Pro Ser Ser Pro Gly 
1155 1160 1165 

aga ata aag ggg agg ctt gat agt tct gaa atg gat cac agt gaa aat 3552 
Arg lie Lys Gly Arg Leu Asp Ser Ser Glu Met Asp His Ser Glu Asn 
1170 1175 1180 

gaa gat tac aca atg tct tea cct ttg ccg ggg aaa aaa agt gac aag 360 0 
Glu Asp Tyr Thr Met Ser Ser Pro Leu Pro Gly Lys Lys Ser Asp Lys 
1185 1190 1195 1200 

aga gac gac tct gat ctt gta agg tct gaa ttg gag aag cct aga ggc 3 64 8 
Arg Asp Asp Ser Asp Leu Val Arg Ser Glu Leu Glu Lys Pro Arg Gly 
1205 1210 1215 

agg aaa aaa acg ccc gtc aca gaa cag gag gag aaa tta ggt atg gat 3696 
Arg Lys Lys Thr Pro Val Thr Glu Gin Glu Glu Lys Leu Gly Met Asp 
1220 1225 1230 

gac ttg act aag ttg gta cag gaa cag aaa cct aaa ggc agt cag cga 3 744 
Asp Leu Thr Lys Leu Val Gin Glu Gin Lys Pro Lys Gly Ser Gin Arg 
1235 1240 1245 

agt egg aaa aga ggc cat acg get tea gaa tct gat gaa cag cag tgg 3 792 
Ser Arg Lys Arg Gly His Thr Ala Ser Glu Ser Asp Glu Gin Gin Trp 
1250 1255 1260 

cct gag gaa aag agg etc aaa gaa gat ata tta gaa aat gaa gat gaa 3 84 0 
Pro Glu Glu Lys Arg Leu Lys Glu Asp lie Leu Glu Asn Glu Asp Glu 
1265 1270 1275 1280 

cag aat agt ccg cca aaa aag ggt aaa aga ggc cga cca cca aaa cct 3 88 8 
Gin Asn Ser Pro Pro Lys Lys Gly Lys Arg Gly Arg Pro Pro Lys Pro 
1285 1290 1295 

ctt ggt gga ggt aca cca aaa gaa gag cca aca atg aaa act tct aaa 3 93 6 
Leu Gly Gly Gly Thr Pro Lys Glu Glu Pro Thr Met Lys Thr Ser Lys 
1300 1305 1310 

aaa gga age aaa aaa aaa tct gga cct cca gca cca gag gag gag gaa 3 9 84 
Lys Gly Ser Lys Lys Lys Ser Gly Pro Pro Ala Pro Glu Glu Glu Glu 
1315 1320 1325 
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gaa gaa gaa aga caa agt gga aat acg gaa cag aag tec aaa age aaa 403 2 
Glu Glu Glu Arg Gin Ser Gly Asn Thr Glu Gin Lys Ser Lys Ser Lys 
1330 1335 1340 



cag cac cga gtg tea agg aga gca cag cag aga gca gaa tct cct gaa 
Gin His Arg Val Ser Arg Arg Ala Gin Gin Arg Ala Glu Ser Pro Glu 
1345 1350 1355 1360 



4080 



tct agt gca att gaa tec aca cag tec aca cca cag aaa gga cga gga 412 8 
Ser Ser Ala He Glu Ser Thr Gin Ser Thr Pro Gin Lys Gly Arg Gly 
1365 1370 1375 

aga cca tea aaa acg cca tea cca tea caa cca aaa aaa aat gtg 4173 
Arg Pro Ser Lys Thr Pro Ser Pro Ser Gin Pro Lys Lys Asn Val 
1380 1385 1390 



<210> 4 

<211> 5355 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (150) . . (4322) 
<400> 4 

eggagaggag gaggaaegge agggctggct gcggaagggg aggggggggg agaaggegat 60 

tggatgegge ggcggcggcg gatcceggag agccccggag tgagcggagt agegagtegg 12 0 

caacccggag gggtagaaat atttctgtc atg get cat tea aag act agg acc 173 

Met Ala His Ser Lys Thr Arg Thr 
1 5 

aat gat gga aaa att aca tat ccg cct ggg gtc aag gaa ata tea gat 2 21 
Asn Asp Gly Lys He Thr Tyr Pro Pro Gly Val Lys Glu He Ser Asp 
10 15 20 

aaa ata tct aaa gag gag atg gtg aga cga tta aag atg gtt gtg aaa 269 
Lys He Ser Lys Glu Glu Met Val Arg Arg Leu Lys Met Val Val Lys 
25 30 35 40 



act ttt atg gat atg gac cag gac tct gaa gaa gaa aag gag ctt tat 
Thr Phe Met Asp Met Asp Gin Asp Ser Glu Glu Glu Lys Glu Leu Tyr 
45 50 55 

tta aac eta get tta cat ctt get tea gat ttt ttt etc aag cat cct 
Leu Asn Leu Ala Leu His Leu Ala Ser Asp Phe Phe Leu Lys His Pro 
60 65 70 

ggt aaa gat gtt cgc tta ctg gta gee tgc tgc ctt get gat att ttc 
Gly Lys Asp Val Arg Leu Leu Val Ala Cys Cys Leu Ala Asp He Phe 
75 80 85 

agg att tat get cct gaa get cct tac aca tec cct gat aaa eta aag 
Arg He Tyr Ala Pro Glu Ala Pro Tyr Thr Ser Pro Asp Lys Leu Lys 
90 95 100 

gat ata ttt atg ttt ata aca aga cag ttg aag ggg eta gag gat aca 
Asp He Phe Met Phe He Thr Arg Gin Leu Lys Gly Leu Glu Asp Thr 
105 110 115 120 

aag age cca caa ttc aat agg tat ttt tat tta ctt gag aac att get 



317 



365 



413 



461 



509 



557 
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Lys Ser Pro Gin Phe Asn Arg Tyr Phe Tyr Leu Leu Glu Asn lie Ala 
125 130 135 



tgg gtc aag tea tat aac ata tgc ttt gag tta gaa gat age aat gaa 605 
Trp Val Lys Ser Tyr Asn He Cys Phe Glu Leu Glu Asp Ser Asn Glu 
140 145 150 

att ttc acc cag eta tac aga acc tta ttt tea gtt ata aac aat ggc 653 
He Phe Thr Gin Leu Tyr Arg Thr Leu Phe Ser Val He Asn Asn Gly 
155 160 165 

cac aat cag aaa gtc cat atg cac atg gta gac ctt atg age tct att 701 
His Asn Gin Lys Val His Met His Met Val Asp Leu Met Ser Ser He 
170 175 180 

att tgt gaa ggt gat aca gtg tct cag gag ctt ttg gat acg gtt tta 749 
He Cys Glu Gly Asp Thr Val Ser Gin Glu Leu Leu Asp Thr Val Leu 
185 190 195 200 

gta aat ctg gta cct get cat aag aat tta aac aag caa gca tat gat 797 
Val Asn Leu Val Pro Ala His Lys Asn Leu Asn Lys Gin Ala Tyr Asp 
205 210 215 

ttg gca aag get tta ctg aag agg aca get caa get att gag cca tat 845 
Leu Ala Lys Ala Leu Leu Lys Arg Thr Ala Gin Ala He Glu Pro Tyr 
220 225 230 

att acc act ttt ttt aat cag gtt ctg atg ctt ggg aaa aca tct ate 893 
He Thr Thr Phe Phe Asn Gin Val Leu Met Leu Gly Lys Thr Ser He 
235 240 245 

age gat ttg tea gag cat gtc ttt gac tta att ttg gag etc tac aat 941 
Ser Asp Leu Ser Glu His Val Phe Asp Leu Tie Leu Glu Leu Tyr Asn 
250 255 260 

att gat agt cat ttg ctg etc tct gtt tta ccc cag ctt gaa ttt aaa 989 
He Asp Ser His Leu Leu Leu Ser Val Leu Pro Gin Leu Glu Phe Lys 
265 270 275 280 

tta aag age aat gat aat gag gag cgc eta caa gtt gtt aaa eta ctg 103 7 
Leu Lys Ser Asn Asp Asn Glu Glu Arg Leu Gin Val Val Lys Leu Leu 
285 290 295 

gca aaa atg ttt ggg gca aag gat tea gaa ttg get tct caa aac aag 1085 
Ala Lys Met Phe Gly Ala Lys Asp Ser Glu Leu Ala Ser Gin Asn Lys 
300 305 310 

cca ctt tgg cag tgc tac ttg ggc agg ttt aat gat ate cat gta cca 1133 
Pro Leu Trp Gin Cys Tyr Leu Gly Arg Phe Asn Asp He His Val Pro 
315 320 325 

ate cgc ctg gaa tgt gtg aaa ttt get age cat tgt etc atg aac cat 1181 
He Arg Leu Glu Cys Val Lys Phe Ala Ser His Cys Leu Met Asn His 
330 335 340 

cct gat tta gca aaa gac tta aca gag tat ctt aaa gtg agg tea cat 122 9 
Pro Asp Leu Ala Lys Asp Leu Thr Glu Tyr Leu Lys Val Arg Ser His 
345 350 355 360 

gac cct gag gaa get att aga cat gat gtt att gtg tea ata gtt aca 1277 
Asp Pro Glu Glu Ala He Arg His Asp Val He Val Ser He Val Thr 
365 370 375 

get get aaa aag gat att ctt ctg gtc aat gat cac tta ctt aat ttt 1325 
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Ala Ala Lys Lys Asp lie Leu Leu Val Asn Asp His Leu Leu Asn Phe 
380 385 390 



gtg aga gag aga aca tta gac aaa cga tgg aga gta cgc aaa gaa gcc 13 73 
Val Arg Glu Arg Thr Leu Asp Lys Arg Trp Arg Val Arg Lys Glu Ala 
395 400 405 

atg atg gga ctt gcc caa att tat aag aaa tat get tta cag tea gca 1421 
Met Met Gly Leu Ala Gin lie Tyr Lys Lys Tyr Ala Leu Gin Ser Ala 
410 415 420 

get gga aaa gat get gca aaa cag ata gca tgg ate aaa gac aaa ttg 1469 
Ala Gly Lys Asp Ala Ala Lys Gin He Ala Trp He Lys Asp Lys Leu 
425 430 . 435 440 

eta cat ata tat tat caa aat agt att gat gat cga eta ctt gtt gaa 1517 
Leu His He Tyr Tyr Gin Asn Ser He Asp Asp Arg Leu Leu Val Glu 
445 450 455 

egg ate ttt get caa tac atg gtt cct cac aat tta gaa act aca gaa 
Arg He Phe Ala Gin Tyr Met Val Pro His Asn Leu Glu Thr Thr Glu 
460 465 470 

egg atg aaa tgc tta tat tac ttg tat gcc aca ctg gat tta aat get 
Arg Met Lys Cys Leu Tyr Tyr Leu Tyr Ala Thr Leu Asp Leu Asn Ala 
475 480 485 

gtg aaa gca ttg aat gaa atg tgg aaa tgt caa aat ctg etc cga cat 1661 
Val Lys Ala Leu Asn Glu Met Trp Lys Cys Gin Asn Leu Leu Arg His 
490 495 500 



caa gta aag gat ttg ctt gac ttg att aag caa ccc aaa aca gat gcc 
Gin Val Lys Asp Leu Leu Asp Leu He Lys Gin Pro Lys Thr Asp Ala 
505 510 515 520 



cca aca tgc tec tgc aag cag get gaa ggt tgt gtg cgt gaa ata act 
Pro Thr Cys Ser Cys Lys Gin Ala Glu Gly Cys Val Arg Glu He Thr 
570 575 580 

aag aag ttg ggc aac ccc aaa cag cct aca aat cct ttc ctg gaa atg 
Lys Lys Leu Gly Asn Pro Lys Gin Pro Thr Asn Pro Phe Leu Glu Met 
585 590 595 600 

ate aag ttt etc ttg gag agg ata gca cct gtg cac ata gat ace gaa 
He Lys Phe Leu Leu Glu Arg He Ala Pro Val His He Asp Thr Glu 
605 610 615 

tct ate agt get ctt att aaa caa gtg aac aaa tea ata gat gga aca 
Ser He Ser Ala Leu He Lys Gin Val Asn Lys Ser He Asp Gly Thr 
620 625 630 

gca gat gat gaa gat gag ggt gtt cca act gat caa gcc ate aga gca 



1565 



1613 



1709 



agt gtc aag gcc ata ttt tea aaa gtg atg gtt att aca aga aat tta 1757 
Ser Val Lys Ala He Phe Ser Lys Val Met Val He Thr Arg Asn Leu 
525 530 535 

cct gat cct ggt aag get cag gat ttc atg aag aaa ttc aca cag gtg 1805 
Pro Asp Pro Gly Lys Ala Gin Asp Phe Met Lys Lys Phe Thr Gin Val 
540 545 550 

tta gaa gat gat gag aaa ata aga aag cag tta gaa gta ctt gtt agt 1853 
Leu Glu Asp Asp Glu Lys He Arg Lys Gin Leu Glu Val Leu Val Ser 
555 560 565 



1901 



1949 



1997 



2045 



2093 
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Ala Asp Asp Glu Asp Glu Gly Val Pro Thr Asp Gin Ala He Arg Ala 
635 640 645 



cca gat atg tea cgt ctg aga ctt get get ggg agt get att gtg aag 
Pro Asp Met Ser Arg Leu Arg Leu Ala Ala Gly Ser Ala He Val Lys 
875 880 885 

ctg gca caa gaa ccc tgt tac cat gaa ate ate aca tta gaa caa tat 



2285 



2333 



ggt ctt gaa ctg ctt aag gta etc tea ttt aca cat ccc ate tea ttt 2141 
Gly Leu Glu Leu Leu Lys Val Leu Ser Phe Thr His Pro He Ser Phe 
650 655 660 

cat tct get gaa aca ttt gaa tea tta ctg get tgt ctg aaa atg gat 2189 
His Ser Ala Glu Thr Phe Glu Ser Leu Leu Ala Cys Leu Lys Met Asp 
665 670 675 680 

gat gaa aaa gta gca gaa get gca eta caa att ttc aaa aac aca gga 2237 
Asp Glu Lys Val Ala Glu Ala Ala Leu Gin He Phe Lys Asn Thr Gly 
685 690 695 

age aaa att gaa gag gat ttt cca cac ate aga tea gee ttg ctt cct 
Ser Lys He Glu Glu Asp Phe Pro His He Arg Ser Ala Leu Leu Pro 
700 705 710 

gtt tta cat cac aaa tct aaa aaa gga ccc ccc cgt caa gee aaa tat 
Val Leu His His Lys Ser Lys Lys Gly Pro Pro Arg Gin Ala Lys Tyr 
715 720 725 

gee att cat tgt ate cat gcg ata ttt tct agt aaa gag ace cag ttt 2381 
Ala He His Cys He His Ala He Phe Ser Ser Lys Glu Thr Gin Phe 
730 735 740 

gca cag ata ttt gag cct ctg cat aag age eta gat cca age aac ctg 2429 
Ala Gin He Phe Glu Pro Leu His Lys Ser Leu Asp Pro Ser Asn Leu 
745 750 755 760 

gaa cat etc ata aca cca ttg gtt act att ggt cat att get etc ctt 
Glu His Leu He Thr Pro Leu Val Thr He Gly His He Ala Leu Leu 
765 770 775 

gca cct gat caa ttt get get cct tgg aaa tct tgg gta get act ttc 
Ala Pro Asp Gin Phe Ala Ala Pro Trp Lys Ser Trp Val Ala Thr Phe 
780 785 790 

att gtg aaa gat ctt etc atg aat gat egg ctt cca ggg aaa aag aca 2573 
He Val Lys Asp Leu Leu Met Asn Asp Arg Leu Pro Gly Lys Lys Thr 
795 800 805 

act aaa ctt tgg gtt cca gat gaa gaa gta tct cct gag aca atg gtc 2621 
Thr Lys Leu Trp Val Pro Asp Glu Glu Val Ser Pro Glu Thr Met Val 
810 815 820 

aaa att cag get att aaa atg atg gtt cga tgg eta ctt gga atg aaa 
Lys He Gin Ala He Lys Met Met Val Arg Trp Leu Leu Gly Met Lys 
825 830 835 840 

aat aat cac agt aaa tea gga act tct ace tta aga ttg eta aca aca 
Asn Asn His Ser Lys Ser Gly Thr Ser Thr Leu Arg Leu Leu Thr Thr 
845 850 855 

ata ttg cat agt gat gga gac ttg aca gaa cag ggg aaa att agt aaa 2765 
He Leu His Ser Asp Gly Asp Leu Thr Glu Gin Gly Lys He Ser Lys 
860 865 870 



2477 



2525 



2669 



2717 



2813 



2861 
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Leu Ala Gin Glu Pro Cys Tyr His Glu lie lie Thr Leu Glu Gin Tyr 
890 895 900 



cag eta tgt gca tta get ate aac gat gaa tgc tat caa gta aga caa 2909 
Gin Leu Cys Ala Leu Ala He Asn Asp Glu Cys Tyr Gin Val Arg Gin 
905 910 915 920 

gtg ttt gec cag aaa ctt cac aaa ggc ctt tec cgt tta egg ctt cca 2957 
Val Phe Ala Gin Lys Leu His Lys Gly Leu Ser Arg Leu Arg Leu Pro 
925 930 935 

ctt gag tat atg gca ate tgt gee ctt tgt gca aaa gat cct gta aag 3005 
Leu Glu Tyr Met Ala He Cys Ala Leu Cys Ala Lys Asp Pro Val Lys 
940 945 950 

gag aga aga get cat get agg caa tgt ttg gtg aaa aat ata aat gta 3053 
Glu Arg Arg Ala His Ala Arg Gin Cys Leu Val Lys Asn He Asn Val 
955 960 965 

agg egg gag tat ctg aag cag cat gca get gtt agt gaa aaa tta ttg 3101 
Arg Arg Glu Tyr Leu Lys Gin His Ala Ala Val Ser Glu Lys Leu Leu 
970 975 980 

tct ctt eta cca gag tat gtt gtt cca tat aca att cac ctt ttg gca 
Ser Leu Leu Pro Glu Tyr Val Val Pro Tyr Thr He His Leu Leu Ala 
985 990 995 1000 

cat gac cca gat tat gtc aaa gta cag gat att gaa caa ctt aaa gat 
His Asp Pro Asp Tyr Val Lys Val Gin Asp He Glu Gin Leu Lys Asp 
1005 1010 1015 

gtt aaa gaa tgt ctt tgg ttt gtt ctg gaa ata tta atg get aaa aat 
Val Lys Glu Cys Leu Trp Phe Val Leu Glu He Leu Met Ala Lys Asn 
1020 1025 1030 

gaa aat aac agt cac get ttt ate aga aag atg gta gaa aat att aaa 
Glu Asn Asn Ser His Ala Phe He Arg Lys Met Val Glu Asn He Lys 
1035 1040 1045 

caa aca aaa gat gee caa gga cca gat gat gca aaa atg aat gaa aaa 3341 
Gin Thr Lys Asp Ala Gin Gly Pro Asp Asp Ala Lys Met Asn Glu Lys 
1050 1055 1060 

ctg tac act gtg tgt gat gtt gee atg aat ate ate atg tea aag agt 3389 
Leu Tyr Thr Val Cys Asp Val Ala Met Asn He He Met Ser Lys Ser 
1065 1070 1075 1080 

act aca tac agt ttg gaa tct cct aaa gac ccg gta eta cca get cgt 3437 
Thr Thr Tyr Ser Leu Glu Ser Pro Lys Asp Pro Val Leu Pro Ala Arg 
1085 1090 1095 

ttc ttc act caa cct gac aag aat ttc agt aac acc aaa aat tat ctg 3485 
Phe Phe Thr Gin Pro Asp Lys Asn Phe Ser Asn Thr Lys Asn Tyr Leu 
1100 1105 IHO 

cct cct gaa atg aaa tea ttt ttc act cct gga aaa cct aaa aca acc 
Pro Pro Glu Met Lys Ser Phe Phe Thr Pro Gly Lys Pro Lys Thr Thr 
1115 H20 H25 

aat gtt eta gga get gtt aac aag cca ctt tea tea gca ggc aag caa 3581 
Asn Val Leu Gly Ala Val Asn Lys Pro Leu Ser Ser Ala Gly Lys Gin 
1130 H35 H40 



3149 



3197 



3245 



3293 



3533 



tct cag acc aaa tea tea cga atg gaa act gta age aat gca age age 



3629 
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Ser Gin Thr Lys Ser Ser Arg Met Glu Thr Val Ser Asn Ala Ser Ser 
1145 1150 1155 1160 



age tea aat cca age tct cct gga aga ata aag ggg agg ctt gat agt 3677 
Ser Ser Asn Pro Ser Ser Pro Gly Arg He Lys Gly Arg Leu Asp Ser 
1165 1170 1175 



tct gaa atg gat cac agt gaa aat gaa gat tac aca atg tct tea cct 
Ser Glu Met Asp His Ser Glu Asn Glu Asp Tyr Thr Met Ser Ser Pro 
1180 1185 H90 

ttg ccg ggg aaa aaa agt gac aag aga gac gac tct gat ctt gta agg 
Leu Pro Gly Lys Lys Ser Asp Lys Arg Asp Asp Ser Asp Leu Val Arg 
1195 1200 1205 

tct gaa ttg gag aag cct aga ggc agg aaa aaa acg ccc gtc aca gaa 
Ser Glu Leu Glu Lys Pro Arg Gly Arg Lys Lys Thr Pro Val Thr Glu 
1210 1215 1220 

cag gag gag aaa tta ggt atg gat gac ttg act aag ttg gta cag gaa 
Gin Glu Glu Lys Leu Gly Met Asp Asp Leu Thr Lys Leu Val Gin Glu 
1225 1230 1235 1240 



3725 



3773 



3821 



3869 



4061 



4109 



cag aaa cct aaa ggc agt cag cga agt egg aaa aga ggc cat acg get 3 917 
Gin Lys Pro Lys Gly Ser Gin Arg Ser Arg Lys Arg Gly His Thr Ala 
1245 1250 1255 

tea gaa tct gat gaa cag cag tgg cct gag gaa aag agg etc aaa gaa 3 96 5 
Ser Glu Ser Asp Glu Gin Gin Trp Pro Glu Glu Lys Arg Leu Lys Glu 
1260 1265 1270 

gat ata tta gaa aat gaa gat gaa cag aat agt ccg cca aaa aag ggt 4013 
Asp He Leu Glu Asn Glu Asp Glu Gin Asn Ser Pro Pro Lys Lys Gly 
1275 1280 1285 

aaa aga ggc cga cca cca aaa cct ctt ggt gga ggt aca cca aaa gaa 
Lys Arg Gly Arg Pro Pro Lys Pro Leu Gly Gly Gly Thr Pro Lys Glu 
1290 1295 1300 

gag cca aca atg aaa act tct aaa aaa gga age aaa aaa aaa tct gga 
Glu Pro Thr Met Lys Thr Ser Lys Lys Gly Ser Lys Lys Lys Ser Gly 
1305 1310 1315 1320 

cct cca gca cca gag gag gag gaa gaa gaa gaa aga caa agt gga aat 4157 
Pro Pro Ala Pro Glu Glu Glu Glu Glu Glu Glu Arg Gin Ser Gly Asn 
1325 1330 1335 

acg gaa cag aag tec aaa age aaa cag cac cga gtg tea agg aga gca 42 05 
Thr Glu Gin Lys Ser Lys Ser Lys Gin His Arg Val Ser Arg Arg Ala 
1340 1345 1350 

cag cag aga gca gaa tct cct gaa tct agt gca att gaa tec aca cag 4253 
Gin Gin Arg Ala Glu Ser Pro Glu Ser Ser Ala He Glu Ser Thr Gin 
1355 1360 1365 

tec aca cca cag aaa gga cga gga aga cca tea aaa acg cca tea cca 4301 
Ser Thr Pro Gin Lys Gly Arg Gly Arg Pro Ser Lys Thr Pro Ser Pro 
1370 1375 1380 

tea caa cca aaa aaa aat gtg taagttgtaa atattacatt tcaaaccaat 4352 
Ser Gin Pro Lys Lys Asn Val 
1385 1390 

ttcaaattat tttgcaaaag ttcctaaatt tgtaaacata catattgetg tatttaaatt 4412 



22 



ccatatattt agccccatta cactaggtac ggcggcgaag tgctaaaagg gaacggcgat 44 72 
gaacaaatgt aattaataac tttctctgtg aaagctttgg aaaaatcttt tttttttttt 4532 
tttttttttg gtcaagcttg aggctgaata aagcctttga tgcacaaaat gggactgctg 4592 
aagagtggac agttggacct tactttggtg accccataca tttgtggtca catgctttag 46 52 
ccatacacat ggtaacattg actatggagt cttgtgaaag tgtaatgtgc gatggctatg 4 712 
tagacataaa gaagaaactt gtaaatatct tttttctttt ttttaatgtt tctgatttct 4772 
gaagtgcttg tatagctttt atctgcggct ttaaactgac agtacccgac tgtttattgg 4832 
atctattgat ttgaaaagaa tttgttagga tagatcttaa gcagtaatct gtcagtgttt 4892 
gtatttgtat tttctgcaat tttactgtga aaaaaaattt gttttcaaca attggtgtca 4952 
ttttcttgat gtcactattt gttggagagt taaatggtct cttccctttg tgtatcttac 5012 
ctagtgttta ctcctgggca cccttaatct tcagaggtgc taaattgtct gccattacac 5072 
cagaaggatg cctctgatag gaggacaacc atgcaaattg tgaaatagtc ctgaagttct 5132 
tggattactt tacacctcag tattgatttg tcccagaatt ttctggcctt tcatggcaat 5192 
gaaaatttta agaagaaaga tttaaagtat tttaatttta aagagtgtgt tataaaataa 5252 
tgtactgaat tctttatccc attttatcat cctttcagtt tttattaatc tactgtatca 5312 
ataaaattct gtaatttgaa tgagtaaaaa aaaaaaaaaa aaa 53 55 



<210> 5 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
Sequence 

<400> 5 

ctawwagscc csgcscaw 18 
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